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"The Time may not be very remote when 

it will be understood that for complete 
initiation as an efficient citizen of 

one of the new great complex world wide 
states that are now developing, it is as 
necessary to be able to compute, to think 
in averages and maxima and minima, as it is 
now to be able to read and to write." 


--H. G. Wells, 
Mankind in the Making. 
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PREFACE 


The success of statistical techniques in improving the 
quality of manufactured goods has led to the adoption of these 
technigues into non-manufacturing fields. Today these highly 
developed mathematical techniques are providing case histories 
of successes in overcoming problems concerned with the 
improvement of goods, services, and performance through better 
control methods, and through analysis of data in decision 
making. Statistics is commonplace in the fields of stock 
market, advertising, sales surveys, and accounting. The statis 
tical methods based on the law of probability, which were 
successful in controlling war-production quality, are now being 
applied to service, staff, and administrative activities. 

Although there are many favorable reports and case 
histories to substantiate the value of the application of 
statistical technique to decision making there are many who 
do not accept these methods. Statistics are "dry" and there is 
mystery in figures and formula. There are many who distrust 
figure people. Statistics is a scare word and hence must be 
sold to be accented. 

It is the purpose of this paper to provide, in as far 
as is possible in dealing with a subject of this sort, a 
nontechnical discussion of these statistical technicues. It 
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is not the purpose here to do more than merely scratch the 
surface of statistical theory to provide only what is necessary 
to demonstrate the application of statistical method in solving 
rather routine problems. It is not necessary to become a 
skilled mathematician, the technical phases of statistical 
method can be delegated to the technicians--if it is clearly 
understood at the management level what the objectives are 

and if proper support is given the staff. Although the 
comptroller need not know detailed technical applications, 

he should be interested and have developed some feel and 
appreciation for such methods. The use of statistical and 
mathematical method is in reality a front-office tool and the 
higher one ascends within the management structure the more 


important it becomes as a management technique of analysis 


and control. 
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CHAPTER I 
STATISTICS AND THE COMPTROLLER 


Introduction 

Statistics is the offspring of a varied ancestry. As 
a result of greed the ancient kings enumerated their people 
for the purpose of taxation; the panic of an English Sovereign 
during the London plague; the cupidity of professional gamb- 
lers; the scientific ardor of the psycho-physicists; the labors 
of mathematicians and astronomers and physicists; students of 
social phenomena, biologists, and educators planning a new 
science of education; from these statistics has descended. 

Today the field of statistics has many followers, yet 
there are many more who feel that any one who deals with 
numbers, and hence statistics, cannot be trusted. Even though 
there is a strong negative view toward the concepts of statis- 
tics, it has nevertheless become firmly established as an 
indispensable tool of the comptroller and scientific manage- 
ment. 

Statistics has become a basic tool of decision making 
and is a basis for many important management functions such as 


market and scientific research, budgeting and forecasting, 


ET 


Helen M. Walker, Studies in the History of Statistical 
Method (2d printing; Baltimore, Md.: The Williams and 


Wilkins Company, January 1931), p. 1. 
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2 
purchasing, production planning, quality control, engineering, 
etc. Of these broad applications the military comptroller is 
vitally interested in reliable methods and techniques for 
budget prediction and forecasting. 

Many in the field of comptrollership do not understand 
how this vital tool can be used to their advantage; there is 
inadequate understanding of this powerful tool even among many 
persons engaged in the fields where statistics has its greatest 
. application. 

One might ask why this new technique of scientific 
management is not better understood and accepted. Some educa- 
tors give as the paramount resson the fact that statistics was 
a rare course in our colleges during the college years of most 
of our key executives today. And even today one can still find 
many schools of business administration of some of our leading 
cOlleges that do not have an adequate statistical curriculum. 
interest? 


Wht Is Statistics and why the Growin 





Most educators wiil define statistics as that branch 
of applied mathematics which deais with the collection, tabu- 
lation, presenting, analyzing, and interpreting quantitative 
data. 


There are many reasons for the growing interest in 
the analysis of quantitative date. One is that the 
tools of analysis have become sharper as refinements 
have been made in statistical techniques. But the 
principal reasons for the increasing use of these 
methods are to be found in certain basic changes 
which have taken place in the American economy. 


First, there has been a rapid transition from the 
day of small local enterprise to an era of centralized 
large-scale production. As the size of the business 
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unit has increased, the importance of executive 
decisions has increased. Today, great corporations 
exceed in wealth the sovereign states which created 
them. Never before have business executives controlled 
so much wealth, nor have so many persons been dependent 
on a single enterprise, and never before have mistakes 
been so costly in terms of money and human suffering. 


The responsibilities of the modern executive have 
grown with the scale of his operations, but his ability, 
as an individual, to reach proper decisions has not 
kept pace. A. C. Nielsen, of the A. C. Nielsen Company 
of Chicago, is a marketing expert who employs quantitative 
data in analyzing problems of marketing for some of 
America's largest corporations. Always, before submitting 
an analysis of a particular problem to the executive who 
has ordered it, he asks the executive what he considers 
the answer to be. He has kept a record of the replies. 
According to Mr. Nielsen the executives have been wrong 
42 per cent of the time. 


Lacking objective statistical studies to guide then, 
the executives would have made almost as many wrong as 
right decisions. 


A second development in the American economy which 
has stimulated the use of quantitative data is the great 
variation which has appeared in economic activity as 
cyclical prosperities and depressions have followed one 
another in rapid sequence. One effect of the apparently 
increasing instability of the economy has been to create 
an urgent need in the fields of business and government 
for a type of analysis of economic conditions which may 
enable the executive to see a short distance into the 
future, to detect the shape of things to come shortly, 
at least, before they arrive with the devastating force 
of depression or the buoyant effect of prosperity. 


There has been a growing use of statistics in govern- 
ment as well as in business. As the government continued to 
grow in size and to assume many functions statistical tech- 
niques were increasingly emphasized. The complexities of 
planning for the economy as a whole, the obligations assumed 


by the various federal bureaus in administering the affairs of 





“william Addison Neiswanger, Elementary Statistical 
Methods (New York: Macmillan Company, 1951), v. 2. 
















maua = a iy ر‎ 
ami teme dem, ! 
de سم نم سه سا‎ 


a oar ume coe c ود‎ ee اس مو‎ 


mum REEL 


سر و 
١ Sz ia -‏ 













حا وم سو Dosen‏ 
















۰ 





pee Ir @ Jtetteee To جو سسو‎ ٢ ےہ موم م‎ 

×٠ مروووسوس۔٭+ .ہی‎ «d om موسو‎ ut مه‎ ‘lew wh سه‎ 
۱81۸ دنت اس ۱و۴‎ ques Mil be QU Aui date af com 
eee cum ee 
s om Dekh Lk لد‎ oe e e eon IR تود‎ Mle Ty 
۱-۴۰ uma vate 4) cda = e 








4 
industry and agriculture, and providing for the national 
security means that decisions made by government officials are 
of great importance in terms of money and human welfare. In 
this area it is obvious, mistakes can be most serious, and 
decisions based on personal direct observation are unlikely. 
The duties and responsibilities of the comptroller of 

the Navy stem directly from those specified in Title IV of the 
National Security Act Amendments of 1949. Title IV states 
among other things that the comptroller shall: 

Establish and supervise principles, policies, and 

procedures of organizational and administrative 


matters relating to the preparation of cost budgets, 
fiscal operation, capital property, accounting, and 


progress and statistical reporting. (italics mine.) 
The Secretary of the Navy in setting forth the basic 


concepts of comptrollership in the Navy emphasized the statis- 
tical function by requiring the comptroller to be responsible 


for “developing guides and criteria for the collection and 





coordination of atatistical Gata and preparing special statis- 


tics as required. (italics mine. )? 

The Controllers Institute's Committee on Ethics and 
Eligibility Standards emphasizes statistics as a Comptroller- 
ship function and states among other things that the controller 
is responsible for the compilation of statistical records as 


required. 


20 S., Congress, National Security Act Amendments of 
1949, Public Law 216, 8lst Congress, ist session, 1949, 
Title IV. 


AU, se, Department of the Navy, Secretary of the Navy 
Instruction 5400.4, November 18, 1955, p. l. 
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5 
At this point it might be well to emphasize this point 


that it is not the purpose of this paper to imply that the 
comptroller be a trained and accomplished statistician. The 
comptroller needs to make decisions rapidly, based upon timely 
and accurate data. It is only through the modern methods of 
statistics and applied mathematics that such 3 desire for data 
collection can be realized; thus it becomes important that the 
military comptroller while essentially a generalist and not a 
specialist, at least among line officers, become thoroughly 
familiar with the capabilities and relationships of statistic- 
fans and statistics in tne broad field of management. 

Management is concerned with what must be done, thus 
1t becomes necessary for the manager to make some sort of fore- 
cast of what events are likely to happen. Often this is 
accomplished by using past experience as a guide, or his 
intuition but, although these can at times be good guides, they 
can alse be Jisastroue mistakes which could have been avoided 
were more accuréte and timely data available and effectively 
used. 

To be effective management needs to know such things 
as the relationship existing between one set of events and 
another, and the trends established by these events, and causes 
of these events in order to add a degree of reliability to its 
forecasts. Usually no invariable relationships of cause and 
effect can be found in the pest or predicted of the future and 
s0 the manager has to deal in the realm of probabilities. This 


he is by training used to doing as a result of mostly dealing 


‘ 
ui toisemme یا سن‎ — id. 6 da de. 

M dem ili oa ADAM Ne a عد و ا عا عار مف‎ 
ماس,‎ 17+۸۸۸۲ ta Loscaee ay aa! © هسومااي ون‎ 

t -e man MAALU CBORD Uap we amy d 
۱م‎ mom وہ دوه جار بسسره ہہ‎ tt Amar رس‎ 
= seai کا مه‎ RHO دوه ۶م وو اس‎ he trans 
سب جه ملس بس لست ادد لل زب‎ oes bb 
+ مسوسطد نعل وہہ جين‎ elles ای ما‎ 
ا سوا‎ yeod al am del TA ,8 واا‎ 
n md tane o IA ICAO AI A ان‎ 
amanna ۳ a e ana: 
سس‎ nm i itin رو سس‎ a 
LL MNA LLL LALOM LLL oxi 
~us Âu me or لوون‎ ٤ وس ونو‎ deme Je ویج‎ 
elt e shine > pe smo Some enka ye مسد« تسم‎ 
eu s e dd 4 ۰ gem cesi qua ádéé ناه كف‎ Lets 
' uf daga ماج‎ ciii ule د اانه د حلاصا‎ 
ier ہم‎ ۱۰٦۸۸۸۰ GAGE انح هرم پممووصصنه اسا استنږ‎ 
سنہ ۔‎ 














LLL LAM E iE 
٠ مہ‎ ee i aot A AO AA MAR A 
e ATA AED روس جج مہ سسوماه سلا المومط بي‎ 
ridindel Ve serrat 4 ote زم‎ web له‎ doers Aker de 
U--—, *u end tw D ALA ERA al ci 
اص ف مت الہ زج دجہت‎ 
٦۰ ameye 1^ مس ند مود م عم مر‎ 
(Frey mw 6 











6 
with human relationships and individual persons, whose 
individual actions cannot be predicted with any degree of 
certainty. 

There are still those in the field of management, 
however, who are afraid of figures. They feel that they can 
see and hear what is going on and do not need figures to tell 
them. But it is impossible to be everywhere at the same time 
and reliance must be placed, on occasion, on the observation 
and judgment of others--within these observations is necessari- 
ly the element of measurement and measurement cannot be possible 
without the use of numbers. 

The techniques of drawing conclusions from numbers is 
where misunderstanding and harm are done. Intuition can pro- 
vide for better conclusions than those of the man who loves 
figures, but does not have the special knowledge of how to deal 
with them, and works out percentages and averages and then 
draws unjustifiable conclusicns from them.? 

It is etill too little realized that there exists 

a technique of dealing with figures in such a way as 
to find out with a very high degree of probability 
what sort of conclusions, if any, can validly be 


drawn from certain sets of figures with what degree 
of accuracy they can be drawn and to present those 
conclusions in a simple form. In this way not only 
is it possible to avoid drawing wholly unwarranted 
conclusions, but also at time to make some sense of 
a set of fisures which at first sight may appear to 
tell nothing. 





The Report of a Committee, for Submission to the 
Twenty-Fifth Anniversary Conference of Management Research 


Groups st Buxton, 1-4 June, 1951 (London: Management Research 
Groups, Manfield House), p. 6. 


bibid. 
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Tnese tecanigues of dealing with figures is the 
service that the trained statistician specializing in mathe- 
matical analysis can give to the comptroller or manager, and 
in subsequent chapters of this paper examples of these 
techniques will be presented. 

The basis of all statistics is counting or measurement 
and the statistician specializes in methods of countin 
or measurement, whether by so-calied complete counts, 
or censuses, or by counts or measurements of a sample of 
the whole, from which the totality can be estimated. To 


this end he uses a special set of applied mathematical 
techniques based on the theory of probability. 


Statistical studies are of two classes, enumerative 
and analytical.  Enumerative study determines how many 
of some classes of things exist at a particular time, 
but is not concerned with why they exist. An analytical 
study investigates patterns and variations either in one 
series of events over a period of time, or between differ- 
ent groups of things. It may or may not be possible as 
a result to make some assessment of causes, but if the 
results of the study are at all positive then scme 
causal relationship is assumed to exist. The routine 
calculation of enumerative data often forms the basis 
for analytical studies. 

It is important at this point that the statistician 
and the accountant not be confused--that a distinction be made 
between the two arts. In accounting the individual account or 
entry is of chief concern. It must be accurate and reliable 
and subject to audit. In statistics it is not necessary to 
have accounting accuracy for each individual item, but rather 
estimates based on sampling is the order of the day. Mathe- 
matical accuracy is still recuired in statistical work and it 
is not intended to imply otherwise, however, some sacrifice 
of reliability can be made of the complete count for a 


sampling study expertly done. 
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The unit loses its identity in a statistical measure 
such as an average or a total. It is the measure or total 
and the picture portrayed by these measures from which deci- 
sions are based and not from each individual unit. The 
statistician may say that a certain fact exists within a 
specified range such as the inventory of a specific item is 
not more than 1050 items, for example, nor less than 950 items. 
The accountant would be required to produce a complete count 
capable of audit. 

Conclusion 

In conclusion it is well at this point to summarize 
this discussion and to point out some valid reasons why the 
comptroller and management should acquire statistical know- 
ledge, and the relationship between management, the comptroller 
and the statistician. 

I think it is generally understood that the comptroller 
is to operate in a staff capacity, however, he may recommend 
to management, but he does not make managements decisions. 
Management looks upon the comptroller as a continuing source of 
ideas and advice on managerial problems. The comptroller must 
have had broad experience with operating programs and problems 
and general management responsibilities. He should have a keen 
analytical ability and be capable of making discriminating 
judgments, 

Since statistical methods are capable of application 
to all of the activities of an organization, and are a service 


function it is fitting then that this function should come 
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9 
under control of the comptroller or financial director, and 
this it does in most cases. 

There are many valid reasons for management to under- 
stand statistics. Such an understanding of course would 
promote a wider use of statistical data in management since 
. problems where statistics can be profitable would be quickly 
recognized. 

Communication with the professional statistician 
would be improved by a better understanding of the art by 
management. Albert Einstein once said, "The formulation of 
the problem is often more essential than the solution." With 
a statistical background management can understand the problems 
involved in fact-finding and thus can avoid misunderstanding 
in research projects conducted in the organization and assure 
effective use of the complied data. 

Management must understand that the science of 
statistics is not infallible and thet there is risk involved 
in decision making based on statistical conclusions. Most 
statistical decisions are based on samples and this involves 
risk. A statistical minded management should understand the 
"risk" involved and the “confidence” in statistical decision 
making. 

Of course a better understanding of statistical 
methods serves to emphasize that good effective statistical 
studies require a trained staff of professional statisticians. 
This will yield valid results with a minimum of cost. 


Statistical functions cannot be assigned to the budget analyst, 
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10 
clerk or bookkeeper in the office or the foreman or 
technician in the plant. 

And finally management will become a more confident 
administrator and enjoy peace of mind, insofar as administra- 
tive matters are concerned, by knowing the basic elements of 
statistics. 

More important he will make less mistakes and will 
be better able to control the mistakes of others in his 


organization. 





CHAPTER II 
HISTORY OF STATISTICAL METHOD 


The Normal Curve 


Origin in the Theory of Probability 
The history and development of statistical method is 


a colorful and interesting study. The statistical methods of 
today are based on the science of probability. The early 
scientists and mathematicians were aware that certain natural 
activities occur in conformity. The early observations of 
Aristotle and astronomers in establishing fundamental physical 
laws of the universe, regimented certain observations in the 
field of science. The ability to predict happenings and 
behavior on the basis of new found laws provided the author- 
itative position of charlatans, mystics, magicians and gamblers. 
Of course at this stage of development it was impossible to 
predict the behavior of certain factors which were caused by 
chance alone, however, this did not discourage or weaken the 
position of these early philosophers and scientists. The real 
beginning of the science of probability did not occur until 


1 


the early part of the 17th century. The exact date for the 


lRobert Kirk Mueller, Effective Management Through 
Probability Controls (New York: Funk and Wagnalls, 1950), 
p. 62. 
11 
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12 
discovery of the normal curve (fig. 1 ch. III) which had its 
beginning and discovery from the theory of probability was 
November le, 1755. 

There were some faint traces, of no more than six or 
eight problems, that intimated the idea of probability before 
the year 1600. These traces appeared in the Orient. One 
example occurred when the Chinese writer Yuan Yuan criticized 
a still older writer, Sun-Tze for tne following reason: 

"Many unnecessary details appear in his works on mathematics, 
such as a certain absurd problem, which surely cannot be 
attributed to him, on the probability that an expected child 
will turn out to be a boy or to be a girl."? 

In Europe the really first references to the theory of 
probability is a statement in a commentary (Venice, 1477) on 
Dante's Divine Comedy concerning the different throws which 
can be made with three dice. The first writer to discuss a 
problem concerning gambling in a mathematical paper was Luca 


Pacioli in his Suma (1494). In this paper he gave the first 





version of the problem concerning the equitable division of the 
stakes between two players of unequal skill when the game is 
interrupted before its conclusion. This problem was repeated 
and amplified in all the works on probability for two hundred 
years. Astronomers played a conspicuous role in developing the 
mathematical theory of probability two centuries later--it is 
worthy of note that both Kepler and Galileo made brief 





“Helen M. Walker, Studies in the History of Statistical 
Method (Baltimore, Md.: The Williams and Wilkins Company, 


January 1931), p. 1, ff. 
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reference to the subject of chance. 

The first real scientific works on probability began 
in the 17th century. The foundations were laid by the two 
most distinguished mathematicians of that time, Blaise Pascal 
and Pierre de Fermat. The theory developed when the Chevalier 
de mMéré, a gambler having unusual ability, even for mathematics, 
proposed certain questions to Fascal. Among these questions 


۹ 


was the celebrated "Problem of Pointe’ concerning the division 
of stakes between two players who separate without completing 
their game. Pascal and Fermat exchanged numerous letters on 
this subject during the year 1654, and in the course of this 
correspondence they generalized the problem more and more until 
at its close, that which had first been a source of perplexity 
to a gambler had been elevated to an important mathematical 
concept. 

Within the next few years several problems relative to 
the outcome of games of chance were proposed end solved in the 
journals of various learned societies by Jacques Bernaulli, 
Montmort, De Moivre, Arbuthnot, Francis Roberts and others. 
The Discovery of the Normal Curve 

The normal curve” was discovered by Abraham De Moivre 


(1667-1754) as a result of his intensive study to his work and 


the Normal Curve: A symmetrical bell shaped-curve 
possessing certain stetietical characteristics. See 
discussion Chapter III. 
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his applications of the binomial theorem” led him to discover 
a formula for the ratio between the miádle term and the sum of 
all the terms of (led). Many of the recent treatises on 
probability and sampling approach the matter by a method 
similar to De Moivre's use of the binomial expansion.” 

This formula was first published November 12, 1733. 
In 1750, De Moivre introduced his Miscellanea Analytica and 


three years later he presented privately to a few friends a 
short seven page paper entitled Approximatis ad Summen Termin- 
orum Binomie a+b” 1n Seriem Expansi. In this obscure treatise 


on abstract mathematics, written in Latin, two centuries ago, 
and believed by De Molvre to have no practical implications 
Outside the realm of games of chance, we have the first formu- 
lation of the concept of the law of errors. He did not 
realize that there would come a time when this theorem would 
powerfully affect the thinking of the world on all its social 
problems, shape the policy of educators in the schools, and 
aid in thousands of investigations in science whose very names 
were then unknown. 

Later De Moivre announced the formula for the curve of 


error, and connected his theory of probability with theology, 


+ Binomial Theorem: The theorem by means of which a 
binomial may be raised to any power without performing the 
multiplications. A binomial is an expression consisting of 
two terms connected by a plus or a minus sign. A binomial 
distribution is a distribution where there are only two 
alternatives. For example in sampling light bulbs, they are 
either good or bad. 


Binomial Expansion: Raising the binomial to a given 


power. 
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urging that this tendency of events to conform to law argues 

a Great First Cause. 
And thus in all cases it will be found, that although 
chance produces irregularities, still the odds will be 
infinitely great,that in the process of time, those 
irregularities will bear no proportion to the recurrency 
of that order which naturally results from Original 
Design. . . . Again, as it is thus demonstrable that 
there are, in the constitution of things, certain laws 
according to which events happen, it is no less evident 
from observation, that these laws serve to wise, useful 
and beneficient purposes, to preserve the stedfast 
Order of the Universe, to propagate the several Species 
of Beings, and furnish to the sentient king such degrees 
of happiness as are suited to their state. 

in these later writings, De Moivre is eager to free 
the subject of probability from its connection with gambling, 
and establish a theological doctrine of a divine order working 
through human affairs and exhibiting itself in the regularity 
of statistical ratios. 

One of the first attempts to reduce the theory of 
probability to rules of thumb which might be followed by per- 
sons having no command of mathematics beyond simple arithmetic 
was made by Augustus De Morgan (1806-1871) in An Essay on 
Probabilities and on Their Application to Life Contingencies 
and Insurance Offices. De Morgan was brilliant and somewhat 
erratic; he wrote on a wide range of mathematical topics. He 
was the greatest English authority of his day on the theory 
of probability. He stated in his Essay on Probabilities that 
heretofore the work in probabilities has been possible only to 


the expert mathematician, but that 
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those who already admit that the theory of probabilities 
is a desirable study, must of course allow that persons 
who cannot pay much attention to mathematics, are 
benefited by the possession of rules which will enable 
them to obtain at least the results of complicated 
problems, and which will therefore, permit them to 
extend their inquiries further than a few simple cases 
connected with gambling. 
His book was really a handbook, full of practical advice and 
sufficient practical rules for the treatment of a set of 
observations. He advised against unwarranted conclusions from 
@ situation. 

Later De Morgan wrote other essays on the Theory of 
Probability and in these he discussed the theory of least 
squares, / the application of the calculus of definite 
integrals, and some complicated problems in probability. One 
portion of his essay is of particular interest because it 
indicates the direction and thinking of scientific men of that 
day. This particular section discusses the probabilities of 
testimony, of miracles, of moral questions, of decisions of a 
jury. He went so far as to suggest that an investigation into 
the validity of the English practice of requiring a unanimous 
decision from a jury might be made by studying five hundred 
trials in which the jury have delivered their verdict at once 


to see if they show a smaller percentage of error than five 


computed lines may be passed through‏ )اا 
the data, or scatter points, figures 2, 3, 4, 5, chapter III‏ 
which are called regression lines, or lines of average relation-‏ 
ship, because they reveal the typical change in the dependent‏ 
variable Y which has, in the past, accompanied a given change‏ 
in the variable plotted on the X axis. This average relation-‏ 
ship may be determined mathematically by the method of‏ 


least squares. 
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TT 
hundred trials in which the jury deliberated for two hours or 
more. Other writers later attempted to assign numerical 
values to the probabilities of the accuracy and sincerity of 
witnesses, and by elaborate mathematical processes attempted 
to reach numerical measures of the probability that a given 
defendant is guilty or innocent. 

Professor Karl Pearson is generally regarded as having 
accomplished the most significant work in the development of 
moments. The analogy between the statistical function obtained 
by taking the sum of the nth powers of the deviations in any 
given distribution, and the mechanical concept of a moment, 
had been noticed by other writers before Pearson, but none of 
them had perceived in that function the powerful tool which 
he developed.? 

Pearson studied extensively the effect of sampling 
upon the moments of a frequency distribution. Pearson and his 
associates set down the general method by which the sampling 
error of any moment might be studied. Among the formulas 
derived were for the standard deviation,’ the standard deviation 


of a coefficient of correlation, 1° the correlation between 





sh‏ وربا 


ÜIbid., p. T6, ff. 


Standard deviation--the most reliable measure of 
variation. Often referred to as the root-mean-square. 


IO correlation coefficlent--an index measuring the extent 
of relationship between two variables. It ranges from -1 
(a perfect negative relationship) to 1 (a perfect 586 
relationship). 
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18 
sampling errors in the standard deviations of two correlated 
traits, and the correlation between sempling errors in a 
coefficient of correlation and in a standard deviation. The 
moments are used in curve fitting and from the moments are 
derived the paraneters? in the Pearson system of curves. 
Percentiles 

The discovery of the importance of the median as a 
measure of central tendency was made about seventy five years 
ago. There were, however, some earlier writers who had 
conceived this measure but who did not place great emphasis 
upon 1t, 1? 

In 1816 Gauss suggested a simple method of computing 
the probable error. He said "let all the errors be arranged 
in order of size without regard to sign, and take the middle 
one if the number is odd, or the mean of the two middlemost if 
the number is even." This value, which he called merely M, 
is thus the median of the absolute value of the errors, and 
similar to, though not necessarily identical with Galton's 
quartile deviation. 

Encke, Quetelet, Fechner, and Galton made similar 
studies. Sir Francis Galton's, (1822-1911), first statistical 
work was a study of Hereditary Genius (1869) which led him to 


feel the need for a satisfactory method of reporting differences 


in achievement and intellectual ability. The following are 


11 و موو وووروح‎ characteristic in a population. For 
example, the true proportion of defectives in a lot of 
50,000 light bulbs. 
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some cuotes from his book: 


The theory of hereditary genius, though usually 
scouted, has been advocated by a few writers in the 
past as well as in modern times. Put I may clair to 
be the first to treat the subject in a statistical 
manner, to arrive at numerical results, and to introduce 
the "law of deviation from the average'into discussions 
on heredity. .. . The range of mental powers between 
« o . the greatest and least of English intellects, 
is enormous. There is & continuity of natural ability 
reaching from one knows not what height, and descending 
to one can hardly say whet depth. . . . I propose in 
this chapter to range men according to their natural 
ebilities, putting them into classes seperated by equal 
degrees of merit, and to show the relative number of 
individuals included in the severel 01858865. ه ه‎ 
The method I shall employ for discovering all this, 
is en epplication of the very curious theoretical 
law of "deviation from and average." First I will 
explain the law and then I will show that the productions 
of natural intellectual gifts come justly within its 
scope. .. . Here we arrive at the undesirable, but 
unexpected conclusion, that eminently gifted men are 
raised as much above mediocrity as idiots are depressed 
below it; a fact that is calculated to considerably 
enlarge our ideas of the enormous * 2 es of 
intellectual gifts between man and man. 


The idea of grades has ite first published statement 
in this book Galton placed the ablest men in each million in 
the highest grade, and the most stupid in the lowest grade, and 
then divided the remaining 99,998 into 14 classes, the average 
ability of each being separated from that of its neighbors by 
equal grades, thus forming a table to be applied to special or 
general ability. It would be true for any type of examination 
whether held in painting, in music, or in statesmanship. The 
proportion between the different classes would be identical in 
all these cases, although the classes are made up of different 
individuals. It is clear that Galton was moving rapidly in 


the direction of a percentile scale, although even the concept 
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20 
of the median is not explicit as yet. 

Galton later develonved percentile scales in assigning 
marks of excellence on test scores he arranged all the members 
of the group in an order of merit. One of Galton's earliest 
attempts to do this scientifically resulted in a "scale of 
merit" for the men who obtain mathematical honors at Cambridge. 

Galton's general method of grades and deviates is 
widely used in assigning, class marks, and in constructing 
educational tests and scales. It is fundamentally the method 
employed for the construction of a T-scale.24 

Correlation 
It would be rather difficult to condense a complete 
history of correlation in this paper. It is the purpose here 
to present only 2 brief account of the develonment of 
correlation. 

Correlation theory developed historically from the 
theory of probability, and it is generally understood that 
correlation was the unique discovery of Sir Francis Galton, 
made early in the last quarter of the nineteenth century, and 
greatly elaborated and refined by Pearson, Edgeworth, and 
Weldon. Many mathematicians came close to the discovery of 
correlation. Several Freneh and German astronomers, physicists 
and mathematicians approached the problem from the point of 


view of mathematical analysis, but failed to see the practical 


amie 1 





D theoretical distribution that goes by the name of 
Student-T distribution, develored by W. S. Gosset who 


published his work under the pen name of Student. 
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21 
significance of the formulas they derived. It has been said 
that this may be as a result of too much attention to theory 
and not enough toward empirical date, 5ط‎ 

In his Memories, Galton tells of the first time he 
recognized that regression would be the key to the problem of 
heredity (the concept of correlation developed from regres- 
sion). Galton diecovered that the laws of heredity were 
solely concerned with deviations expressed in statistical 
unite. 

Galton's earliest studies on resression were in 1875 
when he was conducting experiments with sweet-pea seeds to 
determine the law of inheritance of size. The question at 
this time in Galton's mind was: "How 1s it vossible for a 
whole population to remain alike in its features, as a whole, 
during many successive generation if the average produce of 
each couple resemble their parents?" From the crude data 
obtained and assembledthe law of rezression; the formula for 
the standard error of the estimate,lÓ and the idea that the 
various arrays in the correlation table hsd equal variability. 

In studying the inheritance of stature, Galton 

firet transmuted all female statures to male, multiplying 
them by the constant 1.08 (thie had the effect of 
equating means, but did not take into account the 


different variability in male and female statures). 
The average of the statures of the parents was then 
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16 
Standard error of estimate--a measure of the scatter 
about the line of regression. The standard deviation of the 
observed points about the line. 
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22 
termed the "nid-oerent;" the average of the statures 

of the offspring, the "mid-Fraternity.* From these 

he made "Tables of Stature," later called correlation 
tables. For each array and for the table as a whole 

he found the median. He also found © (semi-intercuartile 
range)17 used as an approximate value of the probable 
error, for the general population, for the mid perents, 
and for each *co-Fraternity,* a *co-Fraternity? being 

all of the adult sons and transmuted daughters of 2 

group of Mid-Parents who have the same stature (reckoned 
to the nearest inch). Examining these values of Q for 
the various Co-Fraternities, he then found them roughly 
the same, regardless of whether the mid-parents were tall 
or short. Examining the medians he found that “However 
paradoxical it may appear at first sight, it is theoretical- 
ly a necessary fact, and one that is clearly confirmed by 
observation, that the Stature of the adult offspring 

must on the whole be more mediocre than the statures of 
their parents." As the medians of the rows and columns 
proved to lie on approximately straight lines, he called 
the latter, lines of regression.l 


Because Galton used the variability in his case Q as the unit 
in which he expressed deviations, these lines of regression 
were symmetrically placed. In 1877 he called the slope of 
these lines r for reversion. In his paper "Regression Towards 
Mediocrity in Hereditary Stature" (1886) he used W, and ۷ 
“Correlations and their Measurement” (1888) he returned to r, 
which now stood for regression. 
Galton defines correlation as follows. 

Two variable organs are said to be co-related when 

the variation of one is accompanied on the average by 

more or less variation the other, and in the came 

direction. . . . The statures of kinsman are co- 

related variables; thus the stature of the father is 

correlated to that of the adult son, and the stature 


of the adult son to that of the father; the stature of 
the uncle to that of the adult nephew, and the stature 


= 





1 inter-quartile range--the middle 50% of the 
observations. 
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23 
of the adult nephew to that of the uncle, and so on. 1? 


Operations Research 

To give a complete account of the development of the 
technique of operations research would require several volumes, 
far more than the few paragraphs that are allotted to the 
subject in this paper. Ever since about the third century 
B. C. political and military leaders have consulted scientists 
for solutions to the problems of war. There are many instances 
On record where these non military specialists have been able 
to assist the military with new ideas about the machines, 
tactics and, strategies of war. 

Operations Research began to produce more and more case 
histories about the time of World War I as both England and the 
United States attempted to analyze military operations by 
using mathematical methods. F. W. Lanchester of England 
produced some pavers in 1914 on the relationships between 
victory, numerical superiority and the superiority of fire 
power. In America Thomas Edison made studies of antisubmarine 
warfare for the Naval consulting board which included the 
compilation of statistics to be used in determining the best 
methods for evading and for destroying submarines . 2° 

By 1939 at the outbreak of World War II in Europe there 


was the nucleus of a British operational research organization 





1 p. 106. 
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20 طوه وم‎ F, McCloskey and Florence N. Trefethen (ed.), 


Operations Research for Management (Baltimore, Md.: John 
Hopkins Press, 1954), p. 4, ff. 
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already in existence. ‘Some of the problems under study at 
this time were to assist military personnel with radiolocation 
and the integrating of the newly developing radar system of 
early warning against enemy air attack with the older system 
of operational control based principally on the observer corps, 
whose members were trained in the sighting, identification, 
and reporting of planes. Other phases of operations research 
consisted of an analysis of all phases of German air activity 
during night operations over Britain; problems concerning the 
detection of ships and submarines by the use of radar equip- 
ment in airplanes, and all phases of antisubmarine warfare. 

Studies of the data obtained from the systematic 
bombing raids by the Allies over Germany indicated that larger 
plane formations suffered smaller percentage losses. These 
findings resulted in the first 1000-plane R. A. F. raid over 
Germany. In the United States the Army Air Force and the Navy 
began work in the field of operations research in 1940 after 
Dr. James B. Connant, then chairman of the National Defense 
Research Committee, visited England and became aware of the 
techniques of operations research. As in Britain early opera- 
tions research for the Air Force grew up around problems arising 
from new radar equipment. Toward the end of the war in the 
Pacific, the U. S. Navy's operational research team undertook 
& crash project in operations research as a result of the 
onslaught of kamikaze attack against Allied ships. The question 
to be answered was: “Should the ship under attack maneuver 


violently to avoid being hit or keep straight in order to get 
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25 
better aim with its anti-aircraft guns?" The operations 
research team working on the problem obtained the records of 
477 attacks, including 172 hits and 27 sinkings. The con- 
clusions reached after analysis was that a large ship should 
maneuver violently, whereas a small ship should change course 
slowly. Recommendations were also made about how ships should 
turn to receive inevitable hits. Tabulations of succeeding 
attacks demonstrated the validity of the recommendations. 
Those ships under attack which observed the recommendations 
were hit 29 per cent of the time, whereas others were hit 47 
per cent of the time. 

Nonmilitary operations research is probably best 
exemplified by the work of Horace C. Levinson, one of the 
first proponents of operations research in the United States. 
He began his work in the 1920's after giving up a career in 
astronomy for another in merchandising where he applied the 
trained scientist's methods to the problems of his employer. 
His work included studies of customers’ buying habits, the 
effect of quick response advertising on sales, and the relation 
of neighborhood environment to types of articles sold. There 
were many others. The field of management consulting has much 
in common with operations research and had its beginning in the 
latter part of the 19th century, when such pioneers of 
scientific management as Taylor, Gantt and Emerson started 
their work. 

The study of operations research eventually found its 


way into the Universities. In 1948, the Massachusetts 
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Institute of Technology established, in collaboration with 
the Navy a course in non military applications of operations 
research. Similar courses were established at the University 
College, London in 1949 and Birmingham University in 1950. 
The Case Institute of Technology became the first institution 
of higher learning to offer a curriculun in operations research 
leading to a degree of Master of Science. Later Columbia and 
John Hopkins followed with courses in operations research. 

Today operations research appears to be in an 
intermediate stage of development. It has proven its usefulness 
by a history of many cases, however, it is still developing 
with such speed that even the most experienced among operations 
research workers hesitate to mark out all the problem areas 


thet are not susceptible to an operations research approach. 
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CHAPTER III 
STATISTICAL M<THODS AND CASE STUDIES 


Although most business executives have a reasonable 
background of mathematical training, the secrecy, mystery and 
terminology with which the statistician and technical man 
surround the subject of statistics are often as confusing as 
the White Queen's example of addition. 

"Can you do addition?" the White Queen asked (of 

Alice). "What's one and one and one and one and one and 
one and one and one and one and one?" 

"I don't , know," said Alice. "I lost count." 

"She can't do addition,” the Red Queen interrupted. ... 

Alice in Wonderland by Lewis Carroll 
You need not be a mathematician to understand the basic 
principles used by statisticians or to use a statistical 
approach to probleme in comptrollership and business problems 
in general; in fact addition, subtraction, long division, and 
an oceagional square root thrown in is about all that is needed 
for an executive to become reasonably familiar with statistical 
techniques. 

It is impossible within the scope of this paper to 
present in detail the many statistical methods used by 


statisticians in obtaining solutions to the many different 





robert Kirk Mueller, Effective Management Through 


Probability Controls (New York: Funk and Wagnalls, 1950), 
p. 74, ff. 
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28 
types of problems; nor is it pessible to attempt to make 
trained statisticians out of those who read this paper, but, 
rather it is intended, that this chapter shall be a non 
technical discussion of some of the more important and useful 
statistical techniques used in decision making, and the 
presentation of the highlights of some actual problems and 
the method used in their solution. 

It is assumed that the reader has had some training in 
statistics and is familiar with the measures of central 
tendency (mean, median and mode), frequency distributions, 
measures of variation (the range, inter quartile range, mean 
deviation) and the most important of all measures of variation 
the standard deviation, and correlation and regression. If 
thie is not the case I refer you to the text booke listed in 
the bibliogrephy and to Appendix B of this paper which contains 
a sample problem using simplified numbers to demonstrate the 
method of obtaining some of these various measures.  Correla- 
tion and regression is taken up in detail in the latter part 
of this chapter. 

Probability 

To provide a proper background for the discussion of 
statistical method it is weil to start with the theory of 
probability since statistical methods are based on this theory. 
The theory of probability is aptly described by the French 
mathematician Emile Borel as follows: 

The theory of probability is of interest to 


artillerymen; it is of interest, also, not only to 
card and dice players, who are its godfathers, but 
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to all men of action, heads of industries or heads 

of Armies, whose success depends on decisions which 

in turn depend on two sorte of factors, the one 

known or calculable, the other uncertain and 

problematic; ít is of interest to the statistician, 

the sociologist, the philosopher. 2 

In lay language probability may be defined in its 
subjective sense as a state of mind as to the degree of 
certainty or uncertainty. <A second interpretation is that 
which usually happens and may be defined as the relative fre- 
quency with which an event occurs in a certain class of events. 
The third interpretation of probability holds that probability 
refere not to events but to vropositions and has nothing to do 
with intensity or belief or the statistical frequency. 
All human affairs rest upon probabilities, and the 

same thing is true everywhere. lf man were immortal, 

he could be perfectly sure of seeing the day when 

everything in which he had trusted should betray his 

trust and in short of coming eventually to hopeless 

misery he would break Gown st last as every good 

fortune, as every dynasty, as every BULL Fe n 

does. In place of this we heve death. . . . 
With improved accuracy of measurement, it has been determined 
that no two things are exactly alike. In the case of & manu- 
facturer of uniform items there exists a family of chance 
causes of variations, such £e tool wear, fluctuation in voltage, 
variations in operator performance, etc. When all of the 
controllable factors are in € state of control, these chance 
causes which are not controlled vary according to a definite 


symmetrical pattern known as the normal frequency distribution 
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Ibid., p. 67. 








+ نږ‎ Sete ot a rin — وہ‎ a 
jè په هم 0ه وښ‎ aaa وسم لس دیما‎ 8 
po yl mess! sp À VIDA > AN 
Aname gar as sm Und cd سا حير‎ OEE Om سی‎ 
ہشقضتہ مہہ یر‎ e او‎ maim چپ جمد‎ PUP JRO: unen 

A e 
و مر مر ےد یم‎ 2 cen ot a 
ct, 3۳۹ ۸۵0۱8۵۰ ۰ set e JAM سم‎ 











غه م د تابتع ناه وو تب وب ہووے 
a T‏ بس rl‏ دولا نواه Ye achat‏ وس 
oe A LAA ۷٧١ seams‏ ضہ!إ ضف iru‏ ای 4 
1 ہما سن نوہ وہس نود مان یب الاين ور یو ېه 
ae ٣‏ × برم ہہ جو a: suey‏ 
e‏ دد نس ree‏ ملاس qe cm‏ عدر Rund‏ رذ mf Vuk a‏ 
anms‏ ه«اسه لا سم ال E SAL‏ 


اوس" 
xe Ro |‏ ~ 














30 


curve, figure میں‎ 





The abscissa or the peak of the curve represents the 
central tendency or average of all measurements with the 
number of observations or measurements varying in accord with 
the curve to the left and right of the center line. It can be 
noted that the number of observations or measurements which 
are farthest from the average are very few in number, but 
they still exist. 


Once the mathematicians had determined that a 
system or crocess undisturbed and subject only to 
chance variations, could be arrayed in a frequency 
distribution curve this information was put to work 
practically by further developing a mathematical 
theory about the characteristics of the voints 
“ scattered’ about the average or central tendency 
of the diezram, Tne ‘scatter’ points are independent 
of the average central tendency. A characteristic 
of the frequency distribution curve will be seen in 
following the curve from the left-hand side upward where 
it is noticed that the direction of this curve changes 
from concave to convex and passes through a flat point 
a little over half way towards the peak. Following the 
curve up over the peak and back down it is seen that 
this tendency is changed to where the curve goes from 
convexity to concavity after passing through another 
flat point. The flat point can be determined geometrically 
and is a characteristic cf each ideal frequency distribu- 
tion. The distance between thie flat point and the mean 
is the same on either side of the central tendency line 
and has been arbitrarily called a "standard deviation" 
designated in mathematical terminology by the Greek 
letter sigme (e). 


It has been determined that the number of parts 

occurring in the area bounded by sigma on each side of 

the middle line corresponds to 68.3% in an ideal frequency 
curve, It has also been determined that the number of 
parts or measurements which occur within two sigma limits, 
that is, twice the standard deviation distance on either 
side of the central average, amounts to 95.5 percent of 
the occurances or parts. The three-sigma limit reaches 
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out into the asymptotic zone and accounts for 99.73 

percent of the parts, aná this three-sigrs lirit is 

often edopted as an economical and practical basis or 

limit for commercial use of statistical control. These 

limits are such that an ideal process operating under 

the influence of chance variability will produce all 

production items within these percentage limits. In 

mathematical language this is referred to as “plus or 

minus three stanitard deviations” or “three sigma 

limit. "6 

In moet cases of industrial or scientific decision 
making it is generally accepted that a probability of 95 per 
cent (two sigma) ie the minimum limit for a good decision. To 
hold out for 99 ver cent confidence would in most ceses involve 
considerable expense. When working at the 95% confidence 
level, it is most probable that a correct decision is made at 
least19 times out of 20. 
Sampling 
Everyone who has considerea business problems knows 

that data on many situations are sometimes required under 
circumstances where it is net economically feasible or even 
possible to take measurements, enumerate, or test the entire 
universe or population under consideration. Very often, 
however, a sample may be of great assistance, and more often 
than not a sample will provice iore accurate results than the 
complete data. Although a complete enumeration will evoid 
making sampling errors, overall accuracy can be increased by 
conducting a sample survey rather than & complete count. For 


example the smaller scope of the sample project makes it 


possible to be more selective in assignment of personnel for 





ON ور سر :سس سس‎ TR ٢ 





Ord, De TO, IP. 





ته م٥ lafdd we) ۷۷ {othe‏ ہہ oF‏ و وہر 








۲٢ a موس همیس اف د د از لب‎ *) 9) nacen 
wit blew! sow. a «PT ده شم کوور دلوا‎ (pepe bar) da 
| c A0» 4; DR EA مب‎ me F2 a e A 
sex s OX smt ویو جس‎ sr iria 
pem x sm see Teme à Ium! snc free ed وهال غلا‎ 
09: 
لت‎ 

— DUM لف وی‎ Msi ۱ 
ult سنا‎ ahe ° mu LES coe چس‎ tard rud 
iw ys UA IDA MOS EE NM یم سس‎ 
^u» Xem feet Sm می‎ fear oF مس ال‎ 
¡rra ep oj nas lema des de dar se «owed haw 
Jo mes سو‎ o dote laa T4 د ما سس سم‎ runa 
سو سس وداج عرس‎ jja [ida مسا‎ 4 e d 
rwv. lis einen af LA. تسا‎ AA ۲۰٢ 
bem ماو‎ paad di orar ت‫‎ TA 
Pete سيا ات ات‎ 
IE سو ونو که‎ Y4 CoO 4S اس‎ ۷-٧ 
مسا‎ C fmit عاونا ب‎ mem or at اه‎ 


Tog o "نين‎ 


J 
b 























55 

the survey operation; to be more thorough in their training; 
and to be able to concentrate to a much greater degree on 
reduction of sampling errors. The net effect of a sample 
survey, as compared to the 100% count, is often in addition to 
being more accurate, is achieved with fewer personnel, less 
paper work, at a lower cost, and in a shorter time." 

It is well at this point to distinguish between a 
population (or universe) and a sample. A population is the 
entire set or group about which we desire some type of informa- 
tion, while a sample is a subset or a part of the population. 
In the population there are certain characteristics which we 
may estimate or which we may desire to observe and control. 
These characteristics are called parameters. The corresponding 
characteristics in a sample are called statistics. 

Sampling is the technique of drawing a few items from 
& universe or mass in such a way as to secure statistics on the 
basis of which we may, by inference, construct a picture of 
the mass from which the items come. The technique of sampling 
is used because frequently it is the only possible, or practical, 
and usually the most efficient means of studying mass data. 
A situation in which sampling is the only possible method 
would occur when materials are tested for quality, the object 
may be destroyed as would be the case in testing bricks or 


steel parts, for example, for tensil strength or for resistence 





Bu. J. Slonim, "Sampling in a Nutshell." Directorate 
of Statistical Services, DCS/ Comptroller, Headquarters, 
U.B.AcT., p. 16, ff. 
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to crushing forces. Sampling would be the only practical 
method for example when the statistical universe is large and 
composed of thousands or even millions of items. To enumerate 
the entire population would be impractical from & time and 
cost point of view. Even though in some case it would be 
possible from a physical or financial standpoint to make a 
complete count, as was pointed out above, it is not necessary 
to do so. A sample that is properly taken will provide tne 
necessary data from which generalizations about the population 
can be made, and far more efficiently. Briefly let us examine 
a few of the sampling techniques that can be used for any of 
the problems the statistically minded comptroller may encounter. 
Simple Random Sampling 

Simple random sampling is simply a sampling technique 
in which chance alone dictates which elements of the popula- 
tion will be members of the sample. All subjectiveness and 
human bias have supposedly been removed. In a random sample 
each element of the universe or population has an equal chance 
of being selected. If, under these conditions a sufficiently 
large number of items is collected, the sample will be a 
miniature cross section of the mass from which the items come. 
Random samples are not haphazard selections, but rather the 
system of selection is controlled and is not a hit or miss 
selection.” Tables of random digits have been prepared to 


assist in obtaining a truly random sample from & population. 





3 illiem Addison Neiswanger, Elementary Statistical 
Methods (New York: Macmillan Company, 1951), p. 96. 
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stratified Sampling 


There are particular situations when it ia desirable 
to have all elements of the population represent. For example, 
when making a study of inventory items, a simple rancom sample 
may exclude the more costly items at the extreme range of the 
population from being selected when it is desirable or 
necessary to have some information about these costly items. 
Stratified samvling is a scheme for accomplisning this tas 
while at the same time retaining the desirable property of a 
random sample. 

To accomplish this the population is divided into 
homogeneous groups, referred to as strata--each element in the 
group is similar to the other elements within the group. For 
example in the case of inventory items we may establish groups 
or strata of (1) all items valued at less than $5.00 (2) items 
valued from $5.00 to $25.00 (3) items valued from $25.00 to 
$99.00 and (4) all items that exceed $100.00. Now that the 
atrata or groups hsve been selected or established we sample 
randomly from within each strats, 1° 


Cluster Sampling 





There are on occasion situations in which the ease of 
sampling and the cost are important, and we are unable to 
select a simple random or stratified sample from a given 


population. In this situation we mey divide the population 





lOchester H. McCall, "A Statistics Manual." Unpublished 
statistics manual prepared for the Navy Graduate Comptroller- 
ship Program, Department of Statistics, The George Washington 
University, Washington, D. C., 1958, p. 34, ff. 
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into clusters which are, in fact, miniatures populations, 
and then to select at random, several clusters, cr a single 
cluster, from the population of clusters. From these clusters 
we may then draw inferences abcut the whole population. In 
order for this technique to be successfully applied the 
clusters must be similar to one another in their characterist- 
ics. For example, if we assume that ell cestroyers should 
have crews with similar attitudefregerding retirement and 
fringe benefits, then the selection of a single crew, or 
several crews would suffice to draw some inference about the 
attitudes of such crews in general; however, should the snip 
commander strongly influence the attitudes of his men, then in 
all probability such a cluster process would not reflect the 
over-all set of attitudes. It can be seen that coste and 
administrative problems would be reduced if cluster sampling 
can be used of one or several crews rather than a random sample 
of all such crews in the Navy. 
Systematic Sampling 

Occasionally the population from which a sample is to 
be taken is arrayed in a numerical sequence; for example, a 
series of vouchers or serialized cancelled checks. In this 
case we can use the technique of systematic sampling rather 
than simple random. Let us suppose for example that we have a 
population and that it is desired to select a random sample of 
200. A Pandom digit may be selected between 000 and 0049 to 
indicate the starting voucher number. From this point on, 
every 50th voucher is selected in a systematic manner. The 


sample of 200 then constitutes a simple random sample under 
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these conditions. We must use caution, however, and be certain 
that the orderings of the population is in no way dependent 
upon the characteristic examined. For example if every 50th 
voucher were associated with a particular company, it would 
be impossible to draw any zeneral inference about 211 of the 
vouchers in the population. 
Sequential Sampling 

Sequential sampling is a technioue develoned during 
World War II by Dr. Abreham Wald of the Statistical Research 
Groun at Columbia University. The philosophy of this technique 
is that often decisions can be made before the entire sample 
has been examined. Ths technique involved in this type of 
sampling is that observations are selected one at a tine, 
with a decision being made at each step, based on the cumulative 
information after the selection of each observation. Three 
decisions present themselves (1) that there is rot sufficient 
information to make a decision and an additional observation 
is necessary; (2) the data are inconsistent with the hypothesis 
being tested and hence it must he rejected; or (3) the data 
are consistent with the hypotheses being tested and hence it 
may be accepted. In other words a decidedly defective popu- 
lation will be apparent after the examination of an extremely 
small pert of a sample, thus it becomes only necessary to 
sample until our hypotheses is either accepted or rejected. 
By tne use Of this technique it is possible to draw a conclusion 
with a sample size of approximately 50% of that used for simple 


random sample. 
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Sampling ang Non Sampling Errors 

Whenever the technique of sampling is used there is 
inherent in the process the possibility of encountering sampl- 
ing errors and sampling biases, however, whenever sampling is 
performed so that every unit in the statistical univerce has 
a chance of being selected and the probability of selection is 
known, the errors of sampling can be controlled satisfactorily. 
This is known as probability sampling, 

Some of the important types of sampling errors are 
encountered when conducting a survey. The average lady over 
forty reports her age as under forty, a bank teller states 
his occupation as a banker and significant data may be 
eliminated from report forms. There are the errors of the non 
response type specifically a household survey conducted during 
the daytime would yield a poor estimate of lady truck drivers, 
an evening follow up of the not at homes would be necessary to 
overcome this bias. Errors occur in sample selection such as 
chunk samples conducted for convenience at Joe's bar or the 
ball park. 

The result of a sample survey will hardly ever agree 
completely with that of a complete enumeration. The difference 
between the two is known as the sampling error. If a probabi- 
lity sample design is used, it is possible to predetermine the 
8176 of sample needed to obtain a specified degree of precision. 

Sampling is used in many ways throughout the Department 
of Defense the results providing important decision making data. 
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For example sample surveys are conducted of military personnel 
to learn certain characteristics about them and to find out 
their opinions and intentions on a wide variety of subjects. 
This information is used for guidance in formulating personnel 
policies and actions that have considerable impact on virtually 
all defense activities. 

Recently a sample survey was conducted of Air Force 
clothing store sales. On the basis of pre-test data it was 
determined that a sample of 35 out of more than 200 stores 
world-wide would yield acceptable reliable clothing size 
distribution. Sales stores were stratified by size of store 
within climatic regions and samples were selected from each 
regional listing. The results of the survey yielded accurate 
size distributions for principal clothing items; the distribu- 
tion of sales made by climatic region, of sales by item, and 
by size of store. Such information as this helps to achieve 
improved purchasing and stockage procedures; smaller inven- 
tories, higher inventory turnover; and better service. 

By way of summary it can be stated that the appropriate 
method of statistical sampling is a function of the data 
desired, the form and availability of the population, the 
administrative requirements and costs associated with sampling. 
To determine the method of sampling is the problem of the 
sampler. 

statistical Inference 
In this section we shall discuss a few of the more 


important techniques of statistical inference and illustrate 
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some of the various techniques with actual case problems. 
Before beginning a discussion of statistical inference it is 
firet important that the term be understood. By inference we 
mean the process of generalizing from a bit or segment of 
information to the state of affairs in the population from 
which the segment of information has been obtained. It is a 
process of inductive reasoning or reasoning from the specific 
to the general. Stated in another way statistical inference 
is reasoning from the statistic and applying it to the popula- 
tion--or generalizing from a set of sample results to the 
population. This is a phase of deductive logic. 

There are two basic problems of statistical inference; 
these are (1) problems of estimation, and (2) testing a 
hypothesis. For example we may select a sample from a popula- 
tion for the purpose of estimating some parameter or character- 
istic in the population such as to estimate the proportion of 
inventory which is valued at less than $20,000. This is known 
as estimation. Or we may have the problem of observing 
whether specifications are being met where items are being 
manufactured and must meet a specific weight or tolerance. 

This is referred to as testing a hypothesis. 

There are several methods for obtaining solutions to 
the two basic problems of inference and will be discussed in 
the following paragraphs. The methods to be discussed are as 
follows: Estimation by (1) the arithmetic mean, (2) proportion, 
(3) correlation coefficient and (4) regression. The techniques 
for testing a hypothesis are (1) the arithmetic mean 
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(2) proportion and (3) correlation coefficient. 
The following table indicates the symbols for the 


population parameters and corresponding sample statistics: 





Characteristic Parameters statistics 
Arithmetic mean كار‎ (Mu) X 
Standard deviation Ó (sigma) S 
Proportion IT (pi) p 
Correlation Coefficient /O (rho) r 


Estimation 


Estimate the True Proportion 
In the following problem it is desired to estimate the 


true proportion of a population possessing a particular 
characteristic. Briefly the problem was to conduct an audit 
and estimate the number of pay records that contain monetary 
errors, 11 These records are maintained in the officers accounts 
office of the Department of the Navy, Washington, D. C. For 
the purpose of this problem a monetary error is defined as the 
omission or commission of an act which is either definitely 
prescribed or prohibited by departmental regulations. In this 
particular problem it was necessary to determine the correct 
sample size; to determine the percentage of erroneous pay 
records in the sample; and to estimate the number of erroneous 


pay records in the universe by statistical inference. 





ligeorge D. Tracy, Lt. Cdr. USN, "A Pay Record Audit 
Problem." Unpublished term paper, Department of Statistics, 
George Washington University, 1958. 
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Because we are dealing with pay records, and pay 
record entries govern entitlement to money, we want to insure 
that the estimate of the error rate in the universe is correct 
with a 99% degree of confidence. 
The following formula is used in the solution of the 


problem. 2 


2۳ MT (l- 7 )N 


2 2 


(N-1) ES +8 77 (1- 77) 


Where: 


Z = 2.58 For 99% confidence limits. Obtained from table I 
appendix A. 

T= .O1 An assumption of the approximate error rate based 
on a previous NRAO AUDIT. 

E = ,.01 (Epsilon) The amount of error that can be 
tolerated in the statistic "p". 

N 5,920 The number of pay records in the universe under 

study. 


ti 


By substitution of this data in the formula we are able to 


determine the value for n the sample size. 








n 2 (24.58)? x (,01) x (1-,01) x 20 
(5,920-1) x (.01)? 4 (2.58)? x (.01) x (1-.01) 
2 = 
مک د د‎ 97( A 20 7 414 
í 6 
n = 627 


Now that the sample size of 627 has been determined 
the next step is to prepare an alphabetical listing of the 
names of the officers, for whom pay records are maintained, by 
running a "money list" using the addressograph plates kept on 


flle for that purpose. Following the preparation of the money 
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list a starting point should be selected at random and this 
point checked. Additional names would be checked at an 
interval of ten names until the 627 names were all checked off. 
The next step of course is to go to the pay record files and 
pull the pay records of the 627 names that have been checked. 
The required sample would be comprised of this group of pay 
records. 

The statistical conclusion that can be made from this 
sampling process is as follows: Let us assume that the sample 
contained 12 pay records that had monetary errors. By referring 
to table 2, appendix A, for 99% confidence limits; where "yp" 
is .02 and the sample size is 627 the table shows that the 
proportion of the population ( JT ) lies within the range of 
.01 and .035 which is written .01</7 7.035. This means that 
with 99% confidence I can be sure that the percentage of 
monetary errors in the universe of 5,920 pay records lies 
between 1% and 3.5%. 

There are other approaches to this type of problem. 
For example if the sample size is large then the following 
formula yields a fairly accurate set of confidence limits: 

p- Z Vp )1-5(/5 > [[ > م‎ + 2 V» (1-p)/n 
Let us consider the following problem. A random sample of 
enlisted men indicated that 21% plan to reenlist when their 
present hitch is up. There were 400 men in the sample. The 
99% limits are: 
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¿21 - 2.576 Y .21 x .79/400 < Mx .21 = 2.576 V.21 0 


cee -.052 < 7 > EY + . 052 








.158 < T7 < 268 


From this we may conclude that the 99% confidence limits for 
the true proportion expecting to reenlist are from .158 to 


12 


.262, ^ or stated another way from 16% to 26% plan to reenlist 


Correlation Coefficient 
There are on occasion problems where two variables are: 
analyzed together, and a correlation coefficient is often 
computed. Because the sample correlation coefficient is 
subject to some variation it is often of interest to estimate 
the true population correlation coefficient. 
There are precise methods for calculating the desired | 
confidence limits, however, there are tables that can be more | 
easily used to obtain the 95% and 99% confidence limits for 
the true correlation coefficient. This of course is especiall) 
useful for the non mathematician. | 
For example a study was run to examine the relationshi| 
between personnel on board, and recurrent maintenance and 
operation costs for the Marine Corps Supply Depot, Albany, 
Georgia. 5 A sample of 16 yielded & correlation of .67. 





1 ecall, Of, Aid. p. 45. 


125. Frank Leis, Major USMC and Jim H. Bolton, 
Captain USMC. "Correlation of Manpower and Maintenance and || 
Operations Cost of Ten Major Marine Corps Posts and Stations. | 
Unpublished term paper, Devt. of Statistics, George Washington) 
University, 1958. 
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Consulting table IV appendix A the 95% confidence limits are 
estimated to be 
وو < هده‎ 7 

The interpretation of these results will be referred 
to later in our discussion of correlation and regression 
problems, along with estimation by regression techniques. 
Arithmetic Mean 

This statistical technique is used when it is desired 
to estimate the mean of a population, ۸) » using the mean 
of our sample X. If the standard deviation of our population 
is known the interval estimate for it is determined by the 
following formula: 


7 X +2 
وہ‎ AC < کے‎ 








The symbol Z is the normal deviate selected from table I to 
yield the desired degree of confidence. If 95% confidence is 
desired, Z = 1.96; if 99% confidence is needed Z = 2.576. 
For example let us assume a sample of flying hours was 
obtained yielding the following results: 
n (number of observations) = 100 
X (mean of the sample) = 1.8 hours 
the standard deviation of the population is known to be 
= .6 hours. For a 95% confidence level the interval 
estimate for (mu) the mean of the population becomes: 
1,8 - 1.96 x .6/ V100 < XX 1.8 + 1.96 x .6/ 6 
1.8 - 2 € (€ 1.8 + .12 
1.68 < ACK 1.92 
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The 95% confidence limits for the true average hours of flying 
time are 1.66 to 1.92 hours. 

Many times, however, A and ک‎ are not known. In 
this case the following formula is used X-t s/ ^/n-l 4. > 
X+t s/ Vn-1. In this formula s = standard deviation of the 
sample which can easily be computed and "t" is obtained from 
tables of selected Students "t" values.*” 

Testing a Hypotheses 

In practically everything we do in our daily routine 
of living we are compelled to make decisions. These decisions 
may be based on experience, impulse and/or scientific evidence. 
Although there is some diversity of basis for decision making, 
it is still possible to study and analyze the merits and dis- 
advantages of the various methods of making decisions by 
evaluating the risks which they force us to assume. A simple 
hypothesis might be of deciding which road to take, the right 
or the left, when arriving at a fork in the road or testing a 
pair of dice to see whether they are biased. The technique 
of testing a hypothesis is used frequently in the field of 
quality control. Here certain standards have been set up for 
production and periodic samples are selected to determine 
whether or not the items meet specification. <A hypothesis is, 
therefore, a statement which it is desired to reject with a 
specified degree of confidence, or to accept for lack of 


sufficient contrary evidence. +? 
14,, 
McCall, op. eit., p. 43. 


l5John E. Freund, Modern Elementary Statistics (New 
York: Prentice-Hall, 1952), p. 189, ff. 
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There are two types of errore that may be committed in 
experimental situations of testing a hypotheses. These are 
called the type one error and the type two error. The type I 
error occurs when the hypothesis being tested is true and we 
have rejected it. Type II occurs when tne hypothesis being 
tested is false and we accept it. 

Generally in testing a hypothesis the null hypothesis 
is formed and is represented by the symbol Ho. The null 
hypothesis 18 a hypothesis which is formulated for the express 
purpose of being rejected. It is much easier to prove things 
wrong than it is to prove things richt. If we were asked, for 
example, to prove that all sheep are white, it would require 
investigation of every single sheep to determine the color. 

On the other hand it would take only one black sheep to show 
that the hypotheses is false. Hence it is much easier to 
calculate the error which may be committed by rejecting a 
hypotheses than it is to calculate the error whicn may be 
committed by accepting it. 

Correlation Coefficient 

When a sample correlation coefficient has been computed 
an interpretation is sometimes desired. In the correlation of 


16 mentioned 


manpower and maintenance and operations costs, 
earlier, the null hypothesis, that there was no significant 
correlation between the two variables in the population of 

personnel on board and maintenance and overations costs was 


formed, 





6 
1 Leis and Bolton, op. cit. 


*' 
وم‎ + emm c (am Tad recs د +٭جج- ۵و جح‎ 
e ms rl بنضاغی د‎ YA AO 710174 
ee AN m 
priri AET pak اش ارس ۰ 0۲۲ تیا مسب سم‎ 
س همه يل لیو‎ ge AA a Mat 
Jas وم‎ SS — A 

















mas os Umm «Hom غو د دوس‎ pn 


A de ۳ < | 
ena"? مہہ‎ ri Tibe له مې‎ ۸۶ Aem مره‎ m d 


"سو elf‏ ے3 ب ہم یس صتعوہ ين r T‏ سم سج ہ 
am Û‏ بر مس عمج جا وروس ہے :یہ ۰۱ مس سریے 
لاص ل ا ESI‏ نمیم وو له ١٨١‏ انس نر malin‏ 
qat om‏ لاس ها عن مود سو مسل جم HOR‏ بسروس عد سد 
NF‏ سب ae‏ ار زمر سل سه "asno‏ ين 
AA‏ ابا did weve‏ ہار نم ما یز اوہ وه ۱۰ ۷ 
OR m mae‏ ور ہو naim Teen edd mala‏ وو 
وود مس ٣۴‏ همو وتو عږ. 


gn 











men ہے‎ A A و‎ - 1 

س همان« تو memire‏ دورج لم somme KAF‏ 

ص ا جاو صصص wa e‏ یم۹۶ مممم رسیں 

sa ۳ ١‏ فاد سښیهمسساد ج ٣۳ oam mem]‏ ولورد 
TIS AAACN‏ 0 الم چات حمنداس نم غه موسنوواوه «د 

Pas ۳ Ch Dua‏ مم رجح caso olimme me‏ مع 








TET 








48 

To test this hypothesis we refer to table IV appendix 
A for ( (rho) = C and n= 16. The 95% confidence limits for 
v are estimated as -.49 < r < +.49 hence it is determined that 
if the calculated correlation coefficient lay outside this 
range we would have a significant correletion between the 
variables, with 95% confidence that a true hypothesis was not 
being rejected. The data for the problem produced an r = .69 
for the Marine Corps Supply Depot, Albany; r = .53 for Marine 
Corps Supply Forwarding Annex San Francisco; and r= .29 for a 
eonsolidetion of the ten major Marine Corps stations. Since 
the r = .29 for the ten major stations falls within the range 
of chance alone -.49 < r < + .49 the hyvothesis that there was 
no significant correlation was accepted. For the individual 
stations, however, the correlation coefficients lie outside the 
range of chance alone, hence tne nypothesis in these cases was 
rejected. Since there was no correlation for the ten major 
سدم‎ ul only a poor correlation for the others there was 
no need to continue the problem for the purpose of estimation 
by linear rerereseion techniques. Further study using regressio 
technioues is necessary to determine the relationship of the 
variables in the case of the other two stations. 

To illuetrete another case 1١۶.٤ us use the problem of 
where a random sample of 200 recruits were given two forns of 
a test. Form A has been in use for 15 years and took & hours 
to administer. A proposed revision, form B, took only two 
hours to administer. If the correlation between these two 


forms was found to be .45, is it safe to conclude at 95% level 
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that there is better than a chance relationship in the 
populationii? Using table IV, appendix A, for = 0 and 
n = 200, the 95% limits for r are estimated as: 
- .14 4 v > + ۸ 

Sinee the sample correlation of .45 exceeds the sample we can 
assume with a S5% confidence that the two forms bear some 
relationship to each other. Further study would be required 
to determine how much. 
Proportion 

Certain tynes of problems are adanted to testíino the 
hypothesis that the true rrorortion ( 77) is cquel to sone 
epecific value. For example an ordnance devot contained a 
stock pile of 40,000 proximity fuses from World War II. If 
possible it was desired to use these fuses in a more modern 
weapons system vrovided they could produce a specific degree 
of reliability. If this is possible it could provide a tre- 
mendous saving in our defense effort. Since the fuse was only 
one component out of many in this system a 95% reliability was 
desired. A sample was taken consisting of 400 fuses, 360 vere 
found acceptable at the 5% level, can the hypothesis thet the 
lot meets reguirements be accepted? The mathematics involved 
in this problem is as follows: 

DP = 360 the amount of the sampls found acceptable 
n = 400 the amount in the sample 
77 = .95 the decree of confidence desired 

p = 360/400 = .90 








lfMoCall, op. cit., D. 52, ff. 
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consulting table 3, appvendix A for n = 400 and JT = .95 
the limite for p are: 
AND C» c 81 

The sample proportion is less than .925 hence we reject the 
hypothesis and conclude with a better than 95% confidence that 
the lot does not meet the reliability requirements. 

A hypothesis may aiso be tested by examining the 
sample mean to see whether it is contained within the Limits. 
If the standard deviation of the population ( o ) is known the 


following formule is used. 





e - = 
- A X Pa dz. 
E" ۲۳۳ > > مر‎ ve 


Let's assume a manufacturing firm plans to establish controls, 
according to covernment regulations concerning a specified 
diameter of a voroduction item. The mean diameter should be 
21em with = .5em. Sample sizes of 16 are chosen periodically 
anû a decision is made regarcing the control process at each 
time. A sample mean of 21.3 has been obtained and the 95% 
level is consicered acceptable. Conclusions: 


l. The null Hypothesis that the process is in control 
is 


Ho 85) 4 Sir 


2. Sinee the amount of error tolerated is  -.05 tne 
appropriate value for Z is 1.96, table I. 


2. The required limits are: 
21-1.96 x .5/ Vi6< X< 21 + 1.96 x ۹6ھ‎ 
2l =- .245 < < 23 + 5 


X 
20.755 « X 4 21.245 
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4, X = 21.3 em. falls outside of the limits, hence 
we reject our hypothesis with better than 95% confidence and 
examine the process. 


When the g íis not known the following formula is used 
بت + ۸ 16 > ۷8-1 8 ۶ - يعر‎ 8/ Vn-1 


the value for "t" is obtained from Students table for values 
of "t", 
| Correlation and Regression 

Correlation analysis is a statistical process of 
discovering and measuring functional relations of two or more 
series of data in order to ascertain the extent to which the 
variables are correlated. In other words to determine to what 
extent they move together or oppositely. Stated another way 
correlation involves a study of the relationships existing 
between two or more variables, while regression may be defined 
as the prediction of one variable by means of one or more 
addition variables. 

To illustrate an approach to this technique when only 
two variables are involved would be to plot our observed data 
on graph paper using an X and Y axis for each of the variables, 
for example smoking versus lung cancer. If the observations 
were scattered as seen below in figure 2 we would assume that 


no correlation exists, 
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however, in figure 3 lung cancer increase with smoking, not 


& significant correlation however. o 





fig. 3 
The observations in figure 4 show a strong positive 


correlation, 





fig. 4 


while figure 5 describes a strong negative correlation. 
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For linear relationships, correlations which describe 
a straight line relationship between the two variables, a 
measure of the degree of this correlation has been developed 
by Karl Pearson. This measure is known as the Pearson product- 
moment correletion coefficient. This measures only the linear 
relationship between the variables and is written: 


[an A s A 





ads Coz x?][ و‎ ۶ 2 - ) z v»? | 
To illustrate the concept of correlation let us 

consider a typical problem and one that is of continuing 
interest to the comptroller from the standpoint of budget 
prediction and cost control. The particular problem under 
consideration was formulated for the expressed purpose of 
attempting to find a method for predicting the recurrent station 
Operations and maintenance costs for Marine Corps posts and 
stations. In this particular case the data did not produce 
significant results when all of the stations were consolidated 
indicating this approach was not a useful predictive device 
under this particular set of cireumstances, however, correla- 
tions were evident when the data for each individual post or 
station was computed and analyzed individually. For the 
purpose of illustrating this technique we shall use the data 
for the Marine Corps Supply Depot, Albany, Georgia. 18 The 
original data are omitted but the following sums were obtained 


from 16 observations. "X" represents personnel on board and 





15 eis and Bolton, op. cit. 
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"Y" represents recurrent operations and maintenance costs 
in dollars, 
ZX = 54,900 
FX? = 190,508,002 


2 
ZY = 61,959%"° 


2 107 
۶ ۲ = 241,323,927” 


o 
ZXY = 213,754,156 0 


( Z X)^ = 3,114,010,000 


e 10^ 
( ZY) = 3,828,917,601* 
x10* 
n= 16 
X = 3,431.25 


Y = 387,243.75 
Substitution of these data in the formula page 53 provides 


the following: 
2 


» 2 2 
|“ (213, 754,156" ( - 3,101,5h9,100%° | 


EMEN p یں سک ہک ار‎ Ce 
|26 (190,508,002)-3,114,010,000] | 16 (21, 323, 927° ,و‎ 838,917,681 [ 


r? = 7 


T 2671854 (this is the square root of r@) 


i 


The r* is called the coefficient of determination and is some- 
times used as a measure of agreement. It Elves the percentage 


of the variation in one variable which may be aceounted for by 


2 


the variation in the other variable. In this case rf is .45139, 
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This indicates that the variability in recurrent operations 
and maintenance costs, 45.12% may be accounted for by the 
variability in personnel on board. The remaining 54.88% is 
due to other factors. 

Sinee our purpose in this problem is to utilize one 
variable to predict the other it now becomes necessary to 
extend the correlation concept into regression. We shall 
consider only the simplest of relaticnships the straight line 
or linear relationship at this time. In the population, the 
true linear relationship is represented by the equation 

TR AR 6 X 
where A and مج‎ are constants, X being the Y intercept 
and 6 being the slope of the regression line. The regression 
line obtained from the sample, is Y = a + bX. a and b are the 
sample estimates for the parameters ےر‎ and 6 . To obtain 
a anû b the following formula is used: 


n ¿MY = EE 27 





d= 
i Ex a ( zx)? 
a=Y-bxX 
Substituting the data in the formula we have: 


2 2 
b= 16 (212,755,15610 ) - 2,401,549,100 9 


16 (190,508,002) - 3,114,010,000 


2 
ba 1.1 gee 
2,132,377 
b = 54,27454 
a = 387,243.75 - 54.27454 (3431.25) 
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387,243.75 - 186,229.52 


fo 
if 


& = 201,014.23 


bX‏ + & دا 


H 


Y - 201,014.23 + 54.2745X 
On the basis of these answers we can now predict the value for 
Y, the recurrent operations and maintenance cost, for a given 
value of X, the personnel on board. If we plot these date 
on a graph we can see that there íis a fair positive correla- 
tion, however, it is felt this correlation is not quite strong 
enough to be used with reliability in estimating budget 


requirements. 


(Figure 6 is on following page) 
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fig. 6 
Our next concern is to determine the standard error 
of the estimate. This is a measure or value of the regression 
line. Of course the better the correlation the closer the 
dots will be clustered about the regression line. This measure 
is actually an estimate of the standard deviation of the ob- 


served points about the regression line. The sample formula 
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for thie measure is as follovwe: 


| 2 2 
53 ء‎ e e سار اج‎ Where sy = y m z yy? 


n 
7 n 
AJ (1 "S 86,973.25 : 


1 با‎ 
Baa — sy z 21,323,927 9 -3, 838,117, 681” 
16 


NUCRT WNOCURCUNMS 4 
J eX EN 34,168,062.50 y^ سم‎ 86,973, 250 


This standard error is relatively small, being roughly 
2%. This result cannot be attributed to a strong correlation, 
but mainly to the relativeiy small renze of values of Y. The 
range of values is approximately 33%. 

Conclusion 

In conclusion of this section it is well to point out 
that the statistical technique were only a few of the many 
different applications of mathematical techniques that can be 
applied to meet the particular needs of the situation. The 
point that this author would like for the reader to carry away 
is that the mathematics involved is merely the substitution of 
numbers for letters and simple arithmetic. It is hoped that 
the examples presented to illustrate the statistical methods 
illustrate the variety of control type problems that can be 
solved by the techniques. In addition to the problems already 
presented, correlation and regression has been used to compare 
direct labor hours to product overhead in dollars for naval 
ship yards. Maintenance and operations costs have been studied 


with aircraft on board for air stations producing some 
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59 
interesting values for r. The saapling techniques discussed 
in the first part of this section have been used to develop a 
work sampling method fer naval shipyards. Correlations have 
been determined for such additional problems as a comparison 
of sick leave taken by various categories of civil service 
personnel, and a comparison cf M and O costs to plant property. 


In the field of work measurement these statistical techniques 


are the major tools. 


893 
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CHAPTER IV 
LINEAR PROGRAMING 


Within the everyday activities of executives and the 
constant requirement for their decisions there exists mathe- 
matical tools, other than those just discussed in chapter III, 
which he may use to assist him. In recent years mathematicians 
have worked out a number of new procedures which make it 
possible for management to solve a wide variety of important 
problems much faster, more easily, and more accurately than 
ever before. These procedures have been called "linear 
programing" and sometimes "mathematical programing." 

Mathematical programing is not just an improved 

way of getting certain jobs done. It is in every 
sense 8 new way. It is new in the sense that double 
entry booking was new in the Middle Ages, or that 
mechanization in the office was new earlier in this 
century, or that automation in the plant is new today. 
Because mathematical programing is so new, the gap 
between the scientist and the businessman--between 
the researcher and the user--has not yet been bridged. 
Mathematical programing has made the news, but few 
businessmen really understand how it can be of use in 
their own companies. 

Within the planning and programing activities of the 
Armed Services there are many problems which are concerned 


with the most efficient use of men and materiel. The solutions 





1 
Alexander Henderson and Robert Schlaifer, "Mathematic- 
al Programinz," Harvard Business Review (May June, 195A), p.75. 
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61 
to these probleme contribute much to successful operation 
and decision making within the various branches of the Armed 
Services. 

The development and application of methods for 
determining solutions to such programming problems is a matter 
of continuing research. With the cooperation of mathematic- 
jans, economists and other scientists, both in and out of the 
Armed Services, mathematical procedures have been devised that 
solve an important class of programing problems. 

Basic Principles 

Problems in the planning and programing area are 
characterized by limitations and constraints imposed upon the 
availabilities of the items under consideration. For example, 
the number of men and dollars available to operate a program 
may be limited. 

These problems are capable of producing many possible 
solutions. That is the constraints of such problems can be 
satisfied in many different ways. The selection of the most 
acceptable or desireable solution ordinarily calls for judgment 
and experience. 

What the mathematical methods of linear programing do 
is to reduce the whole procedure to a simple, definite routine. 
There is a rule for finding a program to start with; there is 
a rule for finding the successive changes that will increase 
the profits or lower the costs, and there is a rule for 
following through all the repercussions of each change. What 


is more, it is absolutely certain that if these rules are 
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62 
followed, they will lead to the best possible program; and 
it will be perfectly clear when the best possible program has 
been found. It is because the vrocedure follows definite 
rules that it can be taught to clerical personnel or handed 
over to automatic computers.“ In other words, by a systematic 
procedure, a single efficient solution is chosen that meets 
the conditions of the problem. A solution is efficient if it 
accomplishes a specified objective of the problem. The 
efficient solution may, according to the problem, minimize 
dollar eost or expenditure of a critical item, or maximize 
come measure of combat effectiveness. Such a solution that 
can solve both the constrainte of the problem and the given 
objective is en optimum solution. 

The comoutational methods of linear programing are 
very general and are applicable to a large variety of problems 
puch as the following: 

l. The most efficient routing (in terms of transporta- 
tion cost, or time, or requirements for use of ships or 
aircraft) of material from a group of supply or production 
points to a group of storage or user points. 

2. The efficient and economical use of scarce raw 
materials, or machines, to produce the maximum output of 
critical goods. 

3. The best assignment of output production of military 
items, such as planes, between competing demands of combat 


and training missions. 
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A. The minimum cost of contract awards to meet a 
given supply requirement from among a group of bidders quoting 
various prices, maxima and minima production for various 
items, and other quantity or price restrictions as to bids. 

Basic to all of the vroblems is the matter of limited 
recourses which must be shared among a number of competing 
demands, and as cen be seen from the above problems, these 
limited recourses may be manoower, or capital, or material, 
or equipment, or possibly a combination of all of these. 

Mathematical programing, of which linear programing 
is but one phase, has been termed a restatement of the central 
formal problem of economics-~-that of allocating scarce 
resources so as to maximize the attainment of some predetermine 
objective--in a form which is designed to be useful in making 
practical decisions in business and economic affairs. The 
accent here is on the word "practical." The theory and 
principles that have led to the development of the mathematic- 
al programing techniques coming increasingly into business and 
military use may well be far beyond the comprehension of the 
non-mathematical layman, but there is nothing theoretical about 
their applications; a glance at the four typical problems 
listed above should dispel any doubt on thet score. It is no 
longer premature to say that mathematical or linear programing 
has proved its worth. . . for finding optimum economic 
programs. . . . It is a preetical tool for the comptroller and 


&ll phases of business planning.° 
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ÎR. Dorfman, "Mathematical, or 'Linear' Programming," 
American Economic Review, XLIII (December, 1953), p. 797. 


O 


> 
& we. iam (ODA ê ime mam mn ue 
(ie — cj 1198 «۴ pum WM) Munt mm رو‎ 
وص‎ o one m d M وو‎ ) e 


ې 


EI 
- 






















,Aeqe —- te 1912244000509 w.. — - afia ب حصنا ونلا‎ 
تقو تو‎ 7 ‘iw se) 49 یہ ام‎ e TO ل٧ وت‎ 
phm Jr coo y hee Sete ed doc» rd 
اه‎ jewelries omi of سب بر د د مو ههو‎ 
سم فد حيس ؟‎ me i erben! s Cue "OAM سبرا‎ ٢ 

wut & Ife Ts cóBA peg v ئ۴ یو‎ wr “OR 

we levers Mae? si fae Do rd Tia! y ntm P 
quA a YY شى مي اسټې ٢٩م موه‎ y e um ۱ 

اجب- 1 es DC lor‏ نس نه 54 د١مم‏ ار amant‏ 
— - عه of‏ سا اسک ته لوغ اسم د وس joss‏ 
| سے ند و «a el Iei a a mut‏ بد ds‏ سد 
A ha‏ أن Tm dme‏ مح ره A Aaii‏ 

owe ve d‏ اس ما امد" qui all‏ سي 
ا زع با dlii A‏ 
ALIAS AAA ^‏ انکر PS" GRASS‏ سایس . = 
A E LL. ID)‏ غو ول عو سو به جوم او دد لوس 
o — omm cm dmn‏ ول اي سس ۰ 


1 


E 


(eT teclat eet eal ce paste 4 ieee هک‎ 
dr *I o9) .— Cau) cm (uem ,سز سس‎ Jdaow aro An 
امه ود رای‎ re ر مدا( ھا‎ ne E AAN 

bimde Wahii Wir Tor IS = 
+ bined a adj دال‎ SMV AD 4A وه د )لټ‎ 
n taadaa coa TÍ m 


ARE 0-۳ £ 
اب‎ P to 


a _ < یح‎ —  À— — سه‎ <<”) 





64 
Problems 

A simple example of an application of linear programing 
is probably the best manner in which to explain and demonstrate 
just what it can do. Of the many types of problems that lend 
themselves to the methods of linear programing for solution 
are a group termed transportation problems. These problems 
call for the shipping of a supply of an item from a group of 
depots to a group of receiving stations. Each depot has a 
limited amount of the item, while each station has a specific 
requirement for the item. Usually there are many routings 
which will supply each station with exactly the required amount 
of the item. The problem is to determine the routing which 
not only fulfills the requirements but also minimizes some 
measure of the program cost. For example, the objective might 
be to minimize one of the following: the total dollar cost, 
the total number of miles of the shipping schedule, or the 
total time the items are in transit.” 

A fictitious example of a routing problem follows. 
The objective in this case is to minimize the total ton-miles. 
Let us assume that Lockbourne AFB at Columbus, Ohio has been 
testing a large item of equipment, weighing a ton, for the 
B-47. It is now desired that this equipment be tried at other 
bases. Five each are required by March AFB at Riverside, 


California; Davis-Monthan AFB at Tucson, Arizona; and McConnell 


Ahe Application of Linear Programming Techniaues to 
Air Force Problems: A Non-technical Discussion (Washington, 
D. C.: Directorate of Management Analysis, DCS/Comptroller, 
Hdqts., USAF, 17 December 1954), p. 10, 
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AFB at Wichita, Kansas. Pinecastle AFB at Orlando, Florida, 
and MacDill AFB at Tampa, Florida, each need three. To supply 
these 21 items Lockbourne AFB at Columbus, Ohio can ship 8; 
Oklahoma City Depot has 8, and Warner-Robins AFB at Macon, 
Georgia has 5. The items of equipment are to be air lifted 
to their destinations. The problem ig to determine the routing 
which fulfills the requirements and minimizes the total ton 
miles. All the foregoing information together with the appro- 
ximate air distances is summarized in the following table: 


MacDillı March! Davis MeConnell|Pinecastle 


Monthan 

Items Re- 

Avail- quire 

Skis une 5 5 5 5 5 
Oklahoma 
City 8 938 1030 824 156 995 
Macon 5 346 1818 |1416 806 296 
Columbus 8 905 1795 0 716 854 


Distance in Miles 


A first attempt at a solution might go like this: Try 
to fill each requirement from the nearest source. For example, 
let McConnell get its 5 items and Davis-Monthan 3 of its 5 
items from Oklahoma City. The requirements for the Florida 
bases should be met, as far as possible, from Macon. Let 
Pinecastle get its 3 and MacDill 2 of its 3 items from Macon. 
The rest of the requirements must be met from Columbus. This 


shipping schedule has a total of 17,792 ton-miles and is 





2Ibid., p. ll, ff. 






= 
eV ۱ ۵‏ دص قحوممہ دیز نک در الع oder?‏ 
په AT, AI O A‏ وس نس e © em‏ 


سد تت سس 
اس ال حم یں سس 
umi LEE v‏ وم نت mal sera‏ 
Lorie Lom d od MI‏ که Clore‏ یک nmm Td E)‏ ې سر 





4p مہ‎ ou up cube melbulee e de د که .چم‎ 
Tene (O cd. fea edi Aot! ون بان وی ما‎ ٦۲۶۷ We 
Eurie FÊ mimo Dm» e]! !oct on صا(‎ AM 
JODIE LI i یا‎ cO EC OOS O 
int ماما ایا کے سس‎ A غمه‎ 
هده سن سه نه نل د وو سیا‎ ۷۹ 
Aah? mto arl hez o Dane جج 0۰۹( دمہ :سید‎ 











66 
illustrated in figure 7, page 67. 

There are many other possible routings. The minimum 
golution, obtained by the methods of Linear Programing, has 
a value of 16,864 ton-miles. The computational procedure start 
with any solution to the problem-~such as the one desired 
above--and in a systematic manner obtains better solutions 
until the optimum one has been determined. This Linear Program 
ing procedure guarantees that the ovtimum will be found. For 
this simple example, it is not difficult, given the number of 
ton-miles in the optimum solution, to discover the correspondin 
routing. Without this information, one would not, in general, 
be able to determine just which solution is optimum. This is 
especially true for larger problems where solutions by hand 
becomes impractical. 

If in attempting to take advantage of the shortest 
distance appearing in the problem by trying to fill all of 
MeConnell's needs from Oklahoma City--the resulting solution 
would be less efficient. This fact serves to emphasize that 
what at first appears to be a logical approach to problems of 
this nature is not necessarily the best one. 

Another simple example of a transportation type 
problem is as follows: Suppose that a regional freight car 
distributor must send empties from three divisions on which 
car surpluses are predicted, to five other divisions exactly 
equal to the total surpluses, though this is not a limitation 
for other than simplifying the example. Conditions can best 


be expressed and summarized in the following table: 
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Origin Destination Surpluses 
D1 D2 D3 D4 D5 
81 9 
S2 4 
53 8 
Shortages 3 5 4 6 3 21 


It should be noted at this point that the problems 
given have necessarily been extremely simple; so simple, in 
fact, as to be susceptible of solution by trial and error if 
not by inspection. But if the examples were too extensive, 
they would fail as instructive and readily comprehendable 
examples. The table merely states that Division Sl has a 
surplus of 9 cars; D2 a shortage of 5 cars, etc. The cost of 
shipping an empty car from a shipping point to a destination 
is tabled as follows: 


Origin Destination 
D1 De D3 D4 D5 
S1 10 20 5 9 10 
S2 2 10 a 50 6 
55 1 20 T 10 4 


The table indicates that it costs $10 to send an empty car 
from Division Sl to Dl; $30 to send a car from S2 to D4, etc. 
Now, while it is possible to obtain a feasible solution by 
inspection, it is not possible to know whether the solution 
selected is the least cost solution, or what greater cost, if 


any, the solution has than the least cost. Linear Programing 


offers a method not only of obtaining the least cost solution, 
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69 
but also of ensuring that it is in fact the least cost. 
Appendix "C" gives the solution to this problem. The minimum 
cost is $150; one might find it interesting to make up a 
feasible solution and cost to see how it compares with the 
$150 tinm? 

When one realizes that the same technique is capable 
of solving problems involving a table 300 x 500 as easily as 
the 3 x 5 table shown in the example, the advantages are 
obvious. The time required is of course greater, but the 
technique is identical, and with the aid of electronic 
computers the time required is a matter of hours or minutes. 

Military Use 

Within the Armed Services it is often desired to make 
an efficient deployment of combat aircraft to competing 
activities or missions given a production schedule. Those 
high priority missions of course have first claim on the avail- 
able aircraft. It is necessary that we provide the maximum 
amount of aircraft for combat, however, we want to simultaneous 
ly allocate enough aircraft to advanced flyinz schools to train 
crews to fly them. Another objective is to minimize the 
surplus of crews. Results to be computed on a world wide 
basis and then by area. These examples are typical of the 
type of problem that faces the military executive that can be 
solved by linear programing procedures. 


These problems of course are mainly of an operational 





operational Research in Distributing Empty 
Cars," Railway Age, CXXXIV (April 20, 1953), pp. 73-4. 
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TO 
nature, however, in the end analysis cost is involved and in 
this the comptroller becomes vitally interested. More speci- 
fically the comptroller would be interested in problems con- 
cerning contract awards. For example: Whenever it is desired 
to procure items from civilian sources, producers of the items 
must be invited to participate in the bidding for contracts. 
The individual manufacturer submits bids in which he states: 

l. The price per unit of article or articles. 

2. The maximum (and minimum, if any) quantity of 
each item that can be produced at the stated price, and 

5. Any other conditions he wishes to impose. The 
bid reflects the manufacturer's desire for profit, his guess 
about the other fellow's bid, and his own peculiar limitations. 

The comptroller ultimately having to pay for the 
procurement of specific quantities of related articles, must 
see that contracts are awarded in such a way that the total 
dollar cost to the government is at a minimum. 

In evaluating bids the comptroller's shop must add 
shipping and other related costs to each of the bidders' quoted 
prices. Similarly, any savings that could be effected by 
agreeing to certain conditions are subtracted, e.g. a discount 
may be allowed for payments made within a certain time. Once 
the contracts have been awarded, the procurement office must 
be ready to demonstrate--even to the satisfaction of the losing 
bidders=--that the total cost to the activity of all contracts 


was the least possible. 
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The Bureau of Ships, Department of the Navy, has 
applied Linear Programing to yet another problem, most closely 
related to the economics problem of the demand too great and 
the resources too few. Electronics equipments must be allocat- 
ed to fill 811 authorized allowance requirements aboard the 
ships and on shore stations of the Naval Establishment. 
Equipments are seldom available in sufficient quantities to 
meet all recuirements. In such cases, priority of ship types 
and projects is one of the controlling factors. The use of 
new programing techniques will give an optimum use factor to 
the equipments available, and hence to the ships they go on, 
than has formerly been possible. 

Limitations and Other Methods 

Limitetions of the technique of applying Linear 
Programing are two-fold in nature: mathematical and practical. 
The practical difficulties are those commonly associated with 
applying any pure science to business and financial problems. 
One finds it difficult to express in suitable measurable terms 
the objectives and controlling factors, or constraints; 
mathematical programing has no monopoly on this difficulty, and 
it is of course a difficulty associated with the particular 
operation under investigation and study, not with the technique 
being employed to investigate it. ‘Secondly, even when one is 
able to find suitable and measurable terms for the operation, 
it may well be exceedingly difficult to determine proper 
numerical values for the coefficients of the terms. Like any 


other problem, if a more precise answer is needed, more precise 
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data must be used, and that in turn entails a thorough anā 
detailed study of the problem. Finally, there is the computa- 
tional labor required to solve the problem, once it has been 
set up in measurable terms with proper coefficients. The 
very fact that mathematical programing techniques are being 
used to investigate a given operation is indication in itself 
that the problem is quite complex, with many variables and 
constraints, such that it does not lend itself to quick 
solution by other means, or by inspection. We must necessari- 
ly expect then in cases of application of the technique that 
computational labor may be immense, even for electronic 
clauclators; must accept that fact as the nature of the beast; 
and be thankful that there is a technique to help us where 
until recently there was nothing but educated guessing. 
"Thus the practical difficulties encountered in applying 
linear programing are in a sense an indication of the level 
of difficulty of the problems which linear programing can 
treat."! 

There are many methods used in solving problems by 
linear or mathematical programing methods. ‘Some types of 
prodlems lend themselves to specific forms of solution better 
than they do to others. At the present time theoretical work 
in methods of linear programing is ahead of the application of 
these techniques, in other words, more is presently known 


about the subject than is put into actual practice. 





"osu 0. Harrison, Jr., “Linear Programming and 
Operations Research," Operations Research for Management, ed. 
Joseph F. MeCloskey and Florence N. Trefethan (Baltimore: 

The Johns Hopkins Press, 1954), p. 237. 




























sv 

hos جم ده‎ a aLibeme c»: ہو‎ dete Ae AA AAA ۷ 
ننس "سد ون ليجو اا — سس‎ A A a ولک‎ 
cow e ¿Las ¿nda uM ۵ 9 —— mon! Lm LE 
--— aos bs PYYA oam ote cmt. addo سی‎ ad 


chet me ca desc! Ls, Ei Sl »می‎ 





یا غه فوخ .بر موسو:ے رد خسار و بر ai‏ دیا 
الس ma m mm too d setiao‏ مو مرت وا 
m‏ سنا مغ (aem op qe Zoe E noe ccm dI delà‏ 
. 7 ور —À qe me imo 0 f d bmi‏ 
محر رھ کر یھ وک هص |اویاف ét mit Ve‏ س 
الأو« ع دهم صن --.. روس نے وس ان مو سورس 

ram ہتسویم وو فش‎ Gh oGm P4 AO A ص١١‎ ١ 
اقمع رو ےر دی وہ تد عرہ س ہ۔۔‎ eed) Linea) © امغر‎ 
pisima (mam Hha gatia مهس سم‎ iA 
dmn wb t Late Ieebévarve d 





1 





= 


DA‏ فص لد ه «amp‏ چا ايحي 
(ens 1 ai” mn rr TH. mm‏ سم مس ۰ می sn»‏ 
Te‏ 

E E سم‎ 


ID 


T^ ami oem ¿IA AAN O AO جج‎ "e 
ar tio! راس اس‎ Ve AUN هم کم ۸ هجهسصاانه‎ m. vent Lo. 
sve Pad ee sd 
e wut لغ‎ Cs QNA e Autem "T وس‎ Temi Ve dicem 

wm ilte am omo سے‎ yes هو‎ "d يا‎ 

ا سا سح و حا cde!‏ مو سا وسم 











"3 

It is impossible within the scope of this paper to 
discuss in detail ail of the presently existing methods. The 
following are a few of the procedures used and a selected 
bibliography for reference: 
| 1, Transportation Procedure? 
2. The Simplex Method” 
3. Profit-Preference Procedurel? 
4, The Solution by a System of Ordinary Differential 

Equations! 

5. The Dual Methoat® 

Perhaps the best known method is the Simplex Method. 
This method requires that a mathematical solution is feasible 
at the start, however, it need not necessarily be a practical 
solution from a business point of view. A sequence of feasible 


solutions are computed; each one is computed from its 


predecessor in a manner such that the best answer is gradually 





Suenderson and Schlaifer, p. 95. 


7thid., p. 100. 


T Charnes, W. W. Cooper, D. Farr, "Linear Programming 


and Profit-Preference Scheduling for 5 Manufacturing Firm," 
rations Research Society 9f America, I 
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Ge W. Brown and J, Von Neuman, "Solutions of Games 
by Differential Equations,” Contributions to the Theory of 
Games (Prineeton, N. J.: Princeton University Press, 1950) 
ed. by Harry V. Kuhn and A. W. Tucker, pp. 73-9. 


12 
C. F. Lemke, "The Dual Method of Solving the Linear 


Programming Problem," Naval Research Logistics Quarterl 
I (March, 1954), pp. 36-47. 
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approached and that a previous solution is never repented. 
There are a finite number of solutions between the range of 
the original one and the optimum. These operations are carried 
out by simple repetitive and tedious arithmetic steps. It is 
possible in the case of simple problems to use the hand method 
or desk calculators, however, for large problems electronic 
computers would be required. 

The easiest procedure and the one I prefer is the 
Transportation Problem Procedure. The name of this procedure 
is somewhat misleading and has the connotation that this 
procedure can only be used for transportation problems, 
however, there are a multitude of problems that can be solved 
by the procedure. It acquired this misleading term by being 
first applied to determine a lowest cost shipping program. 
Appendix "C" contains instructions for its use and provides 
step by step instructions for solving a problem concerning 
empty freight cars, 

The other methods listed are usually applied to 
specific types of problems for which they are better suited 
than the simplex method. By and large, however, experience 
has shown that the Simplex method is the most probable method 


to use. 


Summary 


In summary we can state briefly that Linear Programing 
is a mathematical method which can be applied to a variety of 
problems both civilian and military--and of particular use to 
the comptroller in those areas of cost; and the technicues of 
Linear Programing can be used to find efficient solutions to 


these problems. 
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CHAPTER V 


OPERATIONS RESEARCH 


There is a new concept in management that recently 
has become of vital interest to the comptroller. It is called 
operations research. The basic characteristic of operations 
research is the fact that the majority of the practitioners 
were trained in the basic sciences rather than in engineering 
or administration. Some other important characteristics are: 
that it is concerned with research on the operations of the 
whole organization; the optimization of operations in a menner 
that brings about greater eesurance of both short and long 
range health for the organization; the newest scientific 
methods and techniques are applied to the proble; synthesis 
and extension of the methods and technicues of the older 
management sciences; analytical models are developed and used 
in a manner common to the basic sciences; experimental opera- 
tions are designed and used in 58 manner that give an insight 
into the behavior of actual operations; and the use of inte- 
grated and creative multi-disciplinary team research to solve 


complex operational problems. t 





1 
Joseph F. MeCloskey aná Florence N. Trefethen (ed.), 


Operations Research for Management (Baltimore: Johns Hopkins 
Press, 1954), p. xiv. 
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Operations Research Defined 





By definition Operations Research is: 

The prediction and comparison of the values, 
effectivenesses, and costs of a set of proposed 
alternative courses of action involving man-machine 
systems. To do this, it uses a model of the action 
that has been developed analytically by a logical 
and when feasible, a mathematical methodology. The 
values of the basic action parameters are derived 
from historical analysis of past actions or from 
designed operational experiments. Most importantly, 
because all human and machine factors are meant to 
be included, an estimate of the uncertainty in the 
predicted outcome and in the values, offectivgnesses, 
and costs of the proposed action is provided.“ 

In defining and attempting to understand operations 
research it is often confused with statistics, especially as 
applied to the body of specific techniques based upon 
probability theory. The operations research analyst does, 
however, use statistical methods when apvlicable to finding 
the solution of the particular problem at hand. Because of 
the wide spread use of statistical technique and method used 
in the solution of many problems by the operational research 
analyst, I felt that a discussion of this technique was appro- 
priate in this paper dealing primarily with statistical method. 
Although the operational research analyst does use statistical 
techniques often he is not restricted to them. Statistics is 
concerned primarily with the relations between numbers, while 
operations research is concerned with reaching an understending 
of the operation--of the underlying physical system which the 
numbers represent. This could cause a significant difference 


in the results as well as the approach. For example in 5 
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recent advertising study, the operations research team found 
the key to characterizing the way in which advertising affected 
consumers in the results of a series of "split-run" teste; 
earlier results had been presumed useless efter statistical 
methods, such as analysis of variance and multiple regression, 
had failed to show meaningful conclusions. 

Operations research is also sometimes confused with 
accounting. This is particularly true when the control aspects 
of accounting are being considered, however, it should be 
noted that accounting is one of the principal sources of data 
to support an operations research study. Within the analysis 
and construction of measures of control and the functions of 
operations research there is some overlav as well as a mutuel 
interest on the part of the accountant and the operations 
research analysts. It has been said in comparing the two fields 
that the accountant has served a useful purpose in bringing 
the importance of control measures to the attention of business 
management, while operations research has shown ability in 
building new methods for developing and implementing these 
concepts of control. Other services such as marketing research, 
engineering, and the industrial engineering field are sometimes 
confused with operations research. Perhaps the most significant 
difference between operations research and these other services 
lies in the type of people employed. Operations research 
people are scientists, not experts. Their value is not in 
their knowledge or business experience but rather in their 


attitude and methodology. It should be noted also at this point 
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78 
that operations research is not in competition with the other 
services, but rather compliments the other services and 
frequently serves to integrate other information, and uses 
the expert opinion and factual data provided by other services 
in an organized and systematic analysis. A well organized 
operations research group should have available the services 
and counsel of experts in the other fielis for the most 
effective joint attack on management problems.” 
Types of Problems 

In chapter II of this paver it was noted that operation 
research had its beginning primarily in solving military 
problems of a tactical and strategic nature. It is well at 
this point to list some areas of application other than those 
of a strictly military nature. Today operations research is 
used to solve such diverse business problems as directing 
ealesmen to the right accounts at the right time, dividing 
the advertising budget in the most effective way, establishing 
equitable bonus systems, improving inventory and reordering 
policies, vlanning minimum-cost production schedules, and 
estimating the amount of clerical help needed for a new 
operation in order to accomplish these tasks. Operations 
research analysis uses the mental processes and the methodolo- 
gies which we have come to associate with the research work of 
the ohysicist, the chemist, and the blologist--what has come 
to be called “the scientific method." 





Žoyril C. Herrmann and. John F. Magee, "Operations 
Research for Management," Harverd Business Review (July- 
August, 1955), p. 108. NEN ll mle 2 
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The Scientific Method 

The basic premise underlying the scientific method 
is the belief that phenomena have a cause, and it is the 
scientist's contention that by hard work the mechanism or 
System underlying the observed facts can be discovered and 
used to the investigator's own best advantage. 

The scientific method calls for a special mental 
attitude, thet is a reverence for facts, facts that not only 
explain the result but those that exvlain the causes also. 
When the executive looks at sales figures, he sees them in 
terms of the success of the sales campaign and how it will 
affect profits, while on the other hand the scientist looks 
at these same figures and seeks a ciue to the fundamental 
behavior pattern of the customers. By the process of induc- 
tion he tentatively formulates a theoretical system or 
mechanism; then by an inverse process of induction hs can 
determine what phenomena shoulá take vlace. He then checks 
these against the observed facts. His test is simply a matter 
of determining if quantitative data is produced such as can 
be used for predicting how the customers will behave. For 
example: 

In a company manufacturing specialty products, 
examination of aecount records showed that customer 
behavior could be accurately described as a time- 
dependent Poisson process--a type of phenomenon found 
widely in nature, from problems in biology to nuclear 
physics. This concept yielded the key to establishing 
measures of the efficiency of the salesmen's work and 


of the effect of the promotion in building sales. 
On this basis a new method of directing promotional 
salesmen to appropriate accounts was constructed-- 
and then tested by careful experiments, to see if 
Sales increases resulted at less than proportionate 
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80 
increases in cost (the results in this case 
were spectacular: an overall sales rise in 
six Teure; and a corresponding gain in net 
profite). 
Basic Concepts 

There are four basic concepts important to the 
technique of operatione research. These are (1) the model, 

(2) the measure of effectiveness, (3) the necessity for 
decision, and (4) the role of experimentation. 

The first concept the model is a2a simplified representa- 
tion of an operation, containing only those aspects which are 
of primary importance to the problem under study. The model 
greatly facilitates the investigations of operations. The 
Operations research analysts uses e model in much the same 
manner as the aeronautical engineer uses a model airplane to 
investigate the aerodynamic properties in a wind tunnel; or 
the accountant who uses the accounting model, a simplified 
representation on paper, in the form of accounts and ledgers, 
of the flow of goods and services through a business enterprise, 
and thus measures the rate of flow, the values produced, and 
the performance achieved. Many modeis are also used in physics. 
These are sets of mathematical equations. 

Operations research models are either exact or 
probabilistic: (1) The exact model is used in cases where 
chance plays a small role and the effect of a given action is 
easily and reasonably determined; such as in long range 


production scheduling in the face of known demand, This type 
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of model is accurate enough so that except for some very 
unusual circumstance, over the long run planned and actual 
production will be reasonably close. (2) The probabilistic 
model explicitly recognizes uncertainty such as would be 

the case in advertising where unpredicability of the consumer 
plays a great role, The methods developed for physical 
problems involving mass behavior under random conditions can 
be applied easily and with great value to operations. 

The second basic concept, measure of effectiveness, 
is a measure of the extent that the operation is attaining 
its goal. One measure of effectiveness if the return on 
investment or the net dollar profit. 

The third concept is that of decision making, and the 
existence of alternative courses of action. The objective of 
operations research is to clarify the relation between the 
several courses of action, and to determine their outcomes, 
and to decide which course of action measures up best in terms 
of the company goal. 

The fourth concept concerns the role of experimentation. 
Operations research is essentially the application of science 
to the study of operations. The theory, or model, is built 
up from observed data or from experience. There are two kinds 
of experiments: the first is designed simply to get informa- 
tion, the other type is designed to test the validity of 
conclusions. 

The application of the basic concepts to management 


problems present some areas of difficulty. The first problem 






























n 
و سی سم‎ “e Ty su Fe : ء٦‎ de o « ل اسن ته لو‎ èl مه سن‎ 





————— qol 
cured اس ات اسان‎ ads ¡Mia! 97 "عا‎ 

o: Mus e ates Pulse tunm, 0 ۱ 
وہ‎ |OWM oo vlM iMt Cmm te petal Owe ب ےج‎ : 
مووود بده‎ Io Steere’ de کی‎ aún d © لآ‎ 
de. ALI یوب‎ ete د هه مو يمه. سا‎ ۰ 
ipse د۹‎ AZ api (ema dde má imi Siew 
ptm هم‎ D emm om AL ee ae ٠ 
٠٠٠یہ دہ‎ al ا موم م: اص مو لېا لسا نیت سے ہوم‎ 

-— 1 مه TR UE‏ وی 
۰٢۰ y oe mmm‏ رد ننف s IO‏ 
٨٢۸ Pas 07 a»‏ نصر % e‏ اسن طط ہیں 
toe p atay S0. mje Jo ma AA‏ 
چجلتلجه حسم حجود ١۷ 6) «J‏ وہ 7 سه دا مهم م0م انس 
١6 0‏ نلم ها má me nd‏ نی e‏ 
se ove o4 ID‏ لق I TETE M0 2 — D ueste dbor—5‏ 
کب کی وم سات پک ھا وج JTL.tJe X‏ 
AM riLab)lusKam aj Pera ٨‏ کرھز لھا او sonrisa Ya‏ 
حاۃہ Lbs Ab ado (oc (CO «D A n UD‏ 
md‏ كرد OP qo n‏ مور mu T‏ مو seule sa‏ 
omn m‏ سه su owl)‏ مدمیوسی ‏ رس A?‏ رهز tole‏ 
JE T‏ آم اد سا ۶ re‏ ت itet‏ عد 





sme amos ol (ne د‎ wU ومغ بنټ.‎ 
m. 4 m e أن‎ ETTA A ٩۰ وم هج مس‎ 





82 

is the choosing of the initial area of investigation, 
selecting the personnel to conduct the work and developing 
organizational plans for future growth. In commencing an 
operations research study or investigation there should be 
an opportunity for decision between alternative courses of 
action. The problem must be capable of quantitative measure- 
ment and study. The possibility for data collection must be 
present, and the possibility of evaluating results must be 
present. Management and this applies to comptrollers in the 
military have two alternatives in obtaining operations 
research people. The use of outside consultants may be 
resorted to or train members within the organization. Both 
alternatives have their advantages and disadvantages. Some- 
times a compromise is effective; that is, having an outside 
group assist in getting an internal organization started. 

Operations research should only be undertaken with the 
intention of continuing and expanding its use. The investment 
in knowledge and methodology on the staff is too valuable a 
commodity to throw away. 

Evaluation 

Case histories have shown that the techniques of 
operations research provides a basis for objective analysis 
of operating problems. These studies are quantitative and 
provide the basis for sound estimates of requirements, objec- 
tives and goals, and a basis for planning and decision making. 
Through operations research there has been a technique for 


the application of organized thinking to data already existing 
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within the organization, and for the introduction of new 
concepts and new methods of analysis. 

However, we must make this observation. Operations 
research is not a cure-all for every management ill, nor is 
it a source of automatic decisions. It is limited to the 
study of tangible, measurable factors. The many factors 
affecting management decisions that are intangible and quali- 
tative must be evaluated on the basis of executive judgment 
and intuition. 

The fact that operations research is scientific in 
character rather than expert means that more time is normally 
required to arrive at any solution or conclusion than in the 
case of normal engineering analyses. 

In conclusion, the future of operations research 
appears bright. Successful applications in many fields of 
endeavor and especially industry are overcoming the skepticism 
of many people. The areas of application of the science is 
broad. The new and relatively untested theory of games hold 
promise for the development of strategic concepts. Operations 
research can and will live up to its expectations of helping 


executives to make decisions. 
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CHAPTER VI 
CONCLUSION 


In the preceding chapters we have discussed statistical 
method and its relation to the comptroller along with a brief 
discussion of the historical background concerning the develop- 
ment of mathematical procedures for decision making. The 
statistical methods and case studies were of necessity brief, 
but it is hoped that they served to illustrate the various 
methods of application in solving typical management problems. 
Perhaps some enthusiasm has been engendered for the science 
of mathematical decision making. <A word of caution is in 
order, however, these mathematical methods and especially 
those based on probability theory are only a whetstone for 
common sense in their applications to problems of the comptrol- 
ler and executive decision making. Administrative methods 
already in use within the organization are sharpened with this 
scientific tool which enable the user to locate and to make 
the most of significant factors that bear on tne problem under 
analysis. It is appropriate to emphasize that there is no 
substitute for knowledge, not necessarily knowledge derived 
from books or a college education, but, empirical knowledge 
of one's own business. For example, the small business man 
1s better able to predict his firm's sales for the coming 
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year than a college trained statistician who is unfamiliar 


with the firm or industry. The statistical formulas presented 
in this paper cannot evaluate nor recognize and give cognizance 
to the multitude of factors which bear on the future. This 
is not to say that statistical formulas are useless when a 
multitude of data are available. On the contrary, statistical 
methode are often the only means of determining some order 
out of unordered and unrelated facts. It, therefore, appears 
evident that an intelligent forecast based upon sound analysis 
of the best available data is far superior to a forecast based 
upon hunches and rumors. 

How Much Statistics Should the Comptroller 

Know? 

The comptroller if he is to have an adequate apprecia- 
tion of the use of statistical method in decision making 
should acquaint himself with the following broad categories 
of the statistical field, namely: 

l. Sources of Information 

2. Decision Making Tools 

5. Forecasting 

4. Administrative Controls 

5, Presentation of Data 
Sources of Information 

The comptroller, for an adequate understanding of 
this phase should: 

1. Understand fully the problem for which he 


seeks information. 
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2. Be able to determine what kinds of data are 
necessary for the utilization of decision processes. 

3. Be aware of the limitations of data obtsined 
within his own organization. 

4, Be familiar with listings of source material such 
as specialized governmental and private publications. 
Decision waking Tools 

The comptroller should develop an adequate vocabulary 
in the field of statistical decision theory in order that he 
may communicate more effectively with his statistical staff. 
He should understand the approoriateness cf the methods being 
employed in solving 2 particular problem. The comptroller 
should have some understanding of the formulas involved in the 
process, and more important, of the time involved in the 
application of these formulas. He should be familiar with 
modern computing techniques in order that he may know where 
to turn for help and guidance. 

Forecasting 

Planning is a fundamental problem cf the comptrcller, 
and planning involves projection into the future. All fore- 
casting involves some degree of uncertainty. With adequate 
statistical knowledge, the comptroller can reduce his "uncert- 
ain stabbing’ decisions and base each more on a sound 
scientific basis with established and predetermined risks. 

A good example of the problem encountered in fore- 
casting is given by Mark Twain when he summed uo the nonsense 


side of extrapolation in his book "Life on the Mississippi:" 
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In the space of one hundred and seventy six years 
the lower Mississippi has shortened itself two 
hundred and forty-two miles. That is an average of 
a trifle over one mile and a third per year. Therefore, 
any calm person, who is not blind or idiotic, can see 
that in the old oolitie Silurian period just a million 
years ago next November, the lower Mississippi river 
was upward of one million three hundred thousand miles 
long, and stuck out over the Gulf of Mexico like a 
fishing rod. And by the same token any person can see 
that seven hundred and forty-two years from now the 
lower Mississippi will be only a mile and three quarters 
long, and Cairo (Illinois) and New Orleans will have 
joined their streets together, and be plodding 
comfortably along under & single mayor end mutuel 
board of aldermen. There is something fascinating 
about science. One gets such wholesale returns of 
conjecture out of such a trifling investment of fact. 


Administrative Controls 

Concerning administrative controls it is a well known 
fact that these controle are of little value 1f they are not 
carried out. The comptroller must have sufficient knowledge 
of statistical control techniques to be able to interpret 
changes in control on an objective basis. He must have an 
understanding of the role played by probability in the control 
situation, and be able to evaluate departures from control 
and to take appropriate action when control is violated. 
Presentation of Data 

The best techniques in decision maxing, forecasting 
and control, are of little vaiue if the results are not 
adequately presented. The comptroller should select tabular 
and graphic devices which most effectively communicate the 
desired message. To illustrate this point an editorial writer 
of a financial magazine reproduced a chart from an advertise- 
ment in Washington, D. C. The argument stated Government 


Payrolis Up! Figures on the graph showed an increase from 


ree ۹4 ژ ہے‎ - t 
de nu t ra 


es a+ |] AÑ add a - 1 
ith يدم نم‎ rt ¡Lu » me elle « cw» od 
ہہ‎ A at ei’ سروف‎ a) ته‎ MAA 
"T" Eo ma Mi WAS LAS 2121 do «e eg oq اد‎ 
r&* leid dam "esf "jj (urea Pars com "نم‎ 
و مم وس‎ E م‎ u all pl me h hr 
سا هش‎ at t Lue ۴ ss ieee Vo» ولوس م‎ V. ee 
سس سے ې ځڅه‎ r= U ee می‎ ١ لے‎ 


X^ -AA / pa pu i om og‏ نی 
quu‏ ۸۵۵ ۱ ۸ل sar Vin «fje s it - tile‏ 
یټ A ec wem Omm (elon ۳٣ Ce De‏ 
J ;‏ په "1l" M AL. 23,4.) "sp ws‏ | سب 
اہ — e'A24!‏ 1 سے c Las © ; ۰ E ^. e e‏ ۱ 
یہ 1 nda ema s A - LM wm d 18 v ١ ١ autem‏ 

is 1. e Tp 2 de ليدم ! سم ؛‎ = PEA? =, TT ا‎ 59 
هر‎ Pru irn) وواد‎ + Amp TM Pe سپ‎ 


= 4 IL T 






























مه مې ١‏ 





۱١ ٧۳۳١٣۹ AFILIADA 27‏ ده د سن ما 

ice مه‎ Qe) Tt eure gom va ںیرد جک ےھ‎ Pow ii 

-alimi s du ۶‏ سید مم mme mae‏ سدم یہ 
هرکان ماد ded!‏ ته نت 8199 35 (ug‏ 

" ۱ Od fda | = a ږو‎ 

© igen dn i Y GAIA 


| alil m 





[EJ سی ف‎ 7" 8 
iren won vH 


y) cia e mm‏ ټراه ٨‏ وی ہب 


ہت te Was‏ رس هه" ہم af iye‏ وت 














سکہ 
L‏ 72 تاي غه e ¿ANT lim‏ 
spe WEisil YA wv»‏ ال «iii IM‏ می 
AMAR ALLA SD ۹ ole‏ جو Y‏ 


(someon 7لو‎ 6٣ دو۷-١‎ de hih "الله‎ 
‘Tie NL D N [14 wt 
ETT û Ov? FOS I hubby Ug Vm سه‎ 
کا‎ CeLAN i aT 56 LT KL * 


11 للك 4 و "a war‏ سر وا 


E 


88 
19.5 million to 20.2 million. By rescaling the graph an 


entirely different version was given by this magazine 


Goyernment Payrolls stable! 
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To summarize, it can be seen that the well versed 
Gecision making comptroller must have a vast fund of knowledge 
available to him. Much of this knowledge empirical from a 
particular field. The comptroller must have the knowledge at 
his finger tips or know where to get it. He need not be 
proficient in the deep, mystical, ani theoretical foundations 
of statistics; however, he neecs to know enough of the surface 
material not to be misled or confused. The comptroller must 
trust the foundations on which his decision-making processes 
are based, for then only will he have faith in the course 


of action he chooses to take. 
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VI APPENDIX 
A TABLES 
Table I 


Selected Normal Probabilities 


% Area 
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APPENDIX B 
Directions for Computing the Mean and Standard 


Deviation From a Frequency Distribution 
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X = The mid point between the numbers in the distribution. 
= The frequency or number times these observations occur 
in the distribution. 
fX = Frequency times the mid point. 
a = Deviations from the mean. 


fd = Frequency times the deviations from the mean. 


fac = a times fa. 
= = Means to sum or add up. 


To compute the mean: 


Mean = < څل‎ = 7815 
n 100 


X = 78.15 X = Mean. n = number of observations 
in the distribution 
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APPENDIX B (CONTINUED) 


To compute the Standard Deviation: 


S = Standard Deviation and i = the class interval of the 
distribution 60 to 65 to 70 etc. 


2 
I1 fa? '( fà 
n n 
۱ 2 
١7 GAL eX 
825 100 100 
s=5 \/ 1.81 - .0529 | S = 6.625 
s-5 V 500 7 
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APPENDIX C 
Directions for Solving Problems by the 


Transportation Problem Procedure 


This sample consists of assigning the manner of 
shipping railroad empties from three shipping points having 
excess number of empties to five destinations having, in 
total, the same number of freight car shortages. The data 
for the problem is given in Table A below; that is, the 
shipping rates from each shipping point to each destination, 
the excesses at each shipping point, and the shortages at 
each destination. 


Surplus 
(A) D1 02 D3 D4 05 Totals 
S1 $10 20 5 9 10 9 
52 2 10 8 30 6 4 
55 1 20 ^ 10 4 8 
Shortage 5 5 5 21 
Totals 


It is first necessary to get a feasible solution, that 
is, one which meets the fixed requirements (of excesses and 
shortages) regardless of cost. This solution may be obtained 
as follows. Take the 9 cars at shipping point Sl and fill 
the shortages of 3 cars at Dl and 5 at D2; put the remaining 
car at D3. Take the 4 cars at S2 and put 3 more at D3 to 
complete the shortage there of 4, and put the fourth car at 
D4. Take the 8 cars at S3, put 5 at D4 to complete the 
shortage there of 6 and put the remaining 3 at D5, which 
fills the shortage there of 3. The procedure could obviously 
be used to fill a table of any size. This solution is entirely 
feasible and would cost $351. 

(3 x $10 + 5 x $20 +1 x 55 + 3 x 28 +1 x $30 +5 x $10 + 3 
x $4 = $351) 
The table would then be shown in Table B. 


(B) D1 D2 D3 D4 D5 

81 5 5 1 9 

82 3 1 4 

95 D 3 8 
3 5 4 6 3 21 


Since we learned in Chapter 4 that the method we are 
using leads to the optimum solution by proceeding through a 
series of solutions, each closer to the optimum than its 
predecessor, and never repeating, it is obvious that we may 
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save ourselves a number Of computational steps if we obtain 

a first solution which is the most economical we can get by 
quick inspection. Stated differently, if we use common sense 
to get a starting point which is fairly inexpensive in terms 
of shipping costs, we will save ourselves a great deal of 
computational effort, even in a problem as short and simple 

ag the example we are using for instruction purposes.  There- 
fore let us attack the problem as follows. Take any ship- 
ping point at all and use the capacity (number of empties) 
there to fill those destination requirements (shortages) which 
by the cost of shipping it seems most economical to fill. 

When that capacity is exhausted, take another shipping point 
and do the same, first filling any requirements that were only 
partially satisfied by the preceding shipping point. Using 
this procedure we can proceed as follows, Take the 8 cars at 
83 and put 3 at Dl at $1 each; put 3 of the remaining at D3 

at #4 each; and the other 2 at D3 at $7 each. ۶111 8 
remaining requirements with 2 cars from 52, and put the remain- 
ing 2 from S2 at D2 (both Dl and D5 cost less but are already 
full from the shipment from 353). Finally, complete the table 
by placing the 9 cars from Sl, 3 at D2 and 6 at D4. We then 
have Table C, at a cost of 4179, which is considerably better 
than our random solution of Table B, and therefore offers us 
fewer steps to go through to achieve the optimum solution. 


(c) D1 D2 D3 D4 05 

81 3 6 9 

s2 2 2 4 

S3 3 2 3 8 
3 5 4 6 3 


It is next necessary to build up a type of cost table. 
First, fill in a table showing the actual shipping costs, 
taken from Table A, for those shipments which are actually in 
use ln Table C. This gives Table D. We can refer to the 
"squares" of the table as "square S1D1" etc., and in Table D, 
5124 is equal to $9, or S1D4 = 9. 


(D) m D2 D3 D4 25 
S1 20 g 

S2 10 8 

55 1 7 4 


second, in building up the cost table, of which Table 
D is the first step, assign row and column values on an 
arbitrary basis; row and column values are the values shown 
under those headings in Table E. They are obtained by assigning 
a value to row S1, such as zero, or l, or 2, or anything (we 
have chosen zero) and then under every square of row 91 which 
contains a rate (S1D2, S1D4, etc.) assign a column value such 
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that the sum of the row and the column values equals the sum 
of the value in the table. Thus, we obtain Table E, in which 
S1D2 is 20, and since the row value was zero, the column value 
must be 20; S1D4 is 9 and therefore the column value under D4 
must be 9 so that the row value zero and the column value 9 
equal the square S1D4. It should be noted thet both positive 
and negative values of row and column values may occur. In 
observing 22D2 we see that we already have a column value of 
20, and that the square value is 10, and therefore the row 
value for S2 must be -10. We then proceed to determine the 
column value for D3, since we have a row value of -10 and a 
square value of 8 for 82D5; the column value is necessarily 18. 
Table E is completed by similar steps to give the values shown. 


(E) Di De De DA D5 Row Values 
51 20 9 0 

S2 10 8 ~10 

55 1 T 4 -11 
Column Q 

Values 12 20 18 C 15 


Finally, we make Table E into a cost table, Table F, 
by filling in all the blank squares with the sum of the 
developed row and column values as they are shown in Table E. 
S1D1 then becomes 12, the sum of the Sl value of Zero and the 
Dl value of 12, etc. 


(F) Dl D2 D3 D4 25 Row Values 
S1 12 20 18 9 15 0 

52 2 10 8 -1 5 -10 

S5 1 9 3 -2 4 -11 
Column 12 20 1$ 9 15 

Values 


As one develops familiarity with the method, he omits 
the majority of the tables which have been added for 
instructional purposes. All that is needed is a rate table, 
Table A; a route table, Table C; and a cost table, Tabie F, 
for the first stage of the method. 


It is now possible to determine what change in our 
route of Table C should be made to reduce our shipping cost. 
Examine the table in comparison to the rate table, and find 
the square where the figure in Table F is larger by the 
greatest amount than the figure in Table A. By inspection 
this is square 51D3 in which we find a value of 18 compared 
to Table A's value of 5. We now know that a change in the 
route so that more cars go from S1 to D3 we will save $13 on 
each additional car that we can put on this route. A +86 
explanation of the reason for this statement may be found in 
Henderson and Schlaifer's article. The problem is now to find 
out how many cars we can shift to square D1D3, 1.e., how many 
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cars we can send from shipping point Sl to destination D3 
instead of using the solution of Table C. We do so as follows. 
Construct Table G, and in square S1D3 write +x: this is the 
unknown amount which will be shipped from 51 to D3. The other 
amounts in Table G are obtained by copying Table C. 


(G) D1 22 D3 D4 DS 

$1 $ +X 6% 9 

S2 2 2 4 

87 2 2 3% 8 
2 5 4 6 > 


But we must know which routes are unaffected if we are 
to complete the table properly. We mark those squares in 
Table G, after copying the Table C values, with an * or some 
mark, which is the only number in either its row or its column. 
The +x in 51D3 is counted as a number for this purpose. Hence 
we mark the value S53D1 with an *, since 3 is the only number in 
the row Dl; and we mark the 6 of S1D4 aná the 3 of S3D5 with 
an *. We next consider the * numbers as not in the table, and 
go through the * procedure again, looking for any numbers which 
are now alone ín either rows or columns. Thus we * square 
S3D3, where the 2 is the only number meeting the requirement 
of being alone, in this case in row 53. We cannot * any more 
numbers. We now have Table H, except that the only square with 
an x value in it is the +x in S1D3. We complete Table H by 
disregarding all the * values and filling in +x and -x values 
such that the rows and columns add up properly to the totals 
shown, £1D2 must necessarily become 3-x so that the sum of the 
three values of row £1 (3-x, +x, 6) add up to the known total 9. 


(H) DL D2 D3 D4 D5 

S1 و‎ +X 6% 9 

52 2+X 2-X 4 

5 4 2* مډ‎ 8 
2 5 E 6 5 


The maximum number of cars we can divert to 51D5 is 
then 2, for if we diverted any more we should have & minus 
number of cars for square 52D3, where our new value is c-x; 
one of the postulates of our problem is of course that we do 
not want negative shipments, i.¢., shipments from destinations 
to shipping points. This restraint is normally referred to as 
a "nonenegativity" restraint. By substituting the value of 2 
for x, we obtain Table J (Table I omitted) and can determine 
that the shipping cost is now only $154, or an improvement over 
our value of $179; and as we said, we save 2 cars at $13 each, 
or $26 over the first solution, or £179 - *26 = $154, 
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(J) 21 D2 93 DA D5 

S 1 2 6 9 

g 4 4 

33 3 2 3 8 
5 5 A 6 5 


lt is now necessary to repeat the same procedure, 
beginning with Table J, as we have just done in beginning with 
Table C. Tables K and L are intermediate steps to obtaining 
Table M. In Table K, a row value of 10 for row Sl was taken, 
and the complete table developed just as was done for Table F. 
Comparison of Table K with Table A shows that S3D2 in the 
former is higher in value by 2, and therefore that we will 
save $2 for each car that we can ship over that route if we 
make the proper adjustments in the other routes. Table L, 
developed just as was Table H, shows that we can ship 1 car 
over route 25D2. Our new routing table is then Table M, anā 
its cost is $151, or $2 less than our previous routing Table J. 


(K) D1 D2 D3 D4 D5 

51 ~1 20 5 9 2 10 

S2 -11 10 -5 ~1 ~8 0 

S5 1 22+2 7 1131 A 12 

#11 10 -5 -1 -9 

(L) D1 D2 D3 D4 D5 

S1 l-x 2+6 6% 9 

32 Aat 4 
* +X سح‎ $ 5 8 ۶ 1 
3 5 3 6 5 

(M) Dl D2 D3 D4 D5 

81 6 9 

S 4 4 

83 $ 1 1 2 8 
3 5 4 6 5 


Once more we go through the procedure, obtaining 
Tables N and O in reaching Table P, and improve our cost 
to $150. 





101 


(N) D1 D2 D3 D4 D5 

51 -l 18 5 9 2 9 

1 6~ 1 3- 10 و- 82 

S 1 20 1141 4 d. 
-10 9 -4 0 -7 

(0) D1 D2 D3 D4 D5 

Si 34x 6-x 

S2 بن‎ x=1 

95 23 1 l-x +X 3% 

(P) 21 02 D3 D4 DS 

51 4 5 

S2 4 

95 3 1 1 7 


Another trial, giving Table Q, shows that in no cage 
is there any value in Table Q which is higher than its 
corresponding square in Table A. We know then that we have, 
in Table P, reached the optimum arrangement. There remain no 
changes we can make in that table to obtain a lower cost of 
making the shipments of empty freight cars that Table A gave 
as being required. We nave given the freight car dispatcher 
the information on which to make his decision; should he 
decide, for reasons best known to him, to make & shipment 
other than Table P shows, he can compute the excess cost to 
do so, and base his decision on concrete cost data, not 
conjecture. 


(Q) D1 D2 D3 D4 05 

83 0 19 5 9 3 5 

نات 6- 0 اه 10 9- S2‏ 

$3 1 20 6 10 4 6 
5 14 0 4 -2 
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GLOSSARY 


Arithmetic mean - the simple average of a set of observations. 
It is obteined by adding a set of observations and 
dividing by the number of observations. 


Average (mean) deviation - the simple average of the absolute 
distances of the observations from the mean. 


Binomial distribution - a distribution in which there are only 
two alternatives. Sometimes referred to as a dichotomous 
distribution. For example, light bulbs are defective 
or good. 


Binomial expansion - raising the binomial to a given power. 


Binomial theorem - the theorem by means of which a binomial 
may be raised to any power without performing the 
multiplications. A binomial is an expression consisting 
of two terms connected by a plus or a minus sign. A 
binomial distribution is a distribution where there are 
only two alternatives. For exampie in sampling 
light bulbs, they are either good or bad. 


Coefficient of variation - a measure indicating the percent 
the standard Geviation is of the arithmetic mean. 


Correlation coefficient - an index measuring the extent of 
relationship between two variables. It ranges from -l 
(a perfect negative relationship) to 1 (a perfect positive 
relationship). 


Estimation - that phase of inference in which sample character- 
istics are used to provide estimates of corresponding 
population characteristics. 


Hypothesis - a statement of fact which it is desired to test. 
Usually referred to as a null hypothesis when testing a 
specified characteristic of a population. 


Inter-quartile range - the middle 50% of the observations. 


Those two values which include the middle 50% cf the 
cases in a distribution. 


Median - The middie observation when a set of data have been 
ordered or ranked. 


Mode = that observation in a distribution which occurs more 
than any other. 
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Normal curve - 2 eyrmetrical bell shaped curve possessing 
certain statistical characteristics. 


Normal distribution - often referred tO as Gaussian. A bell 
shaped curve possessing certain statistical character- 
istics. 


Parameter ~- a characteristic in a population. For exampie, 
the true proportion of defectives in a lot of 100,000 
shells. 


Poisson distribution - a skewed distribution which is 
encountered frequently in supply probiems. 


Population - sometimes referred io 298 universe. The entire 
group or set of observations in which we have an interest. 


Proportion - a measure between O and 1.00 indicating the ratio 
of the number possessing a given characteristic to the 
total number of observations. For exemple, a proportion 
of .80 means that 80% of the observations possess a 
certain characteristic. 


Random - a procedure whereby only chance influences selection. 
Not synonymous with haphazard. 


Range - the distance between the largest and smallest 
observation in a set of observations. 


Regression - a study of the predictive value of a line in 
explaining the degree of relationship between two 
variables. 


Regression coefficient ~- the average amount of change in Y 
per unit change in X. 


Sample - a part, or sub-set, of a population or universe. 
Skewness - a departure from symmetry. 


standard deviation - the most reliable measure of variation. 
Often referred to as the root-mean-square, 


Standard error of estimate - a measure of the scatter about 
the line of regression. The standard deviation of the 
observed points about the line. 


Standard error of the mean - the standard deviation of sample 
means, A measure of the variability in a sample mean. 
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Standard score - tne numper of standard deviations an observa- 
tion is from the arithmetic mean. Fer example, a 
standarg score of -2 indicates that the observation 1s 
two standsrd deviations below tne mean. 


Statistic - a characteristic ina sample. For example, the 
proportion of defectives in a sample of 100 shelis. 


Statistics - that branch of applied mathematics which deals 
with experimental deeizn and the collection, tabulation, 
analysis, interpretation, and presentation of data. 


T scale ~- a theoretical distribution that goes by the name of 
Student-T distribution, developed by W. 5. Gosset who 
published his work uncer the pen name of Student. 


Theory of least squares - mathematically computed lines may 
be passed throusn the data, or scatter voints, which are 
called regression lines, or lines of average relation- 
ship, because they reveal the typical change in the 
dependent variable Y which has, in the past, accompanied 
a given change in the variable plotted on the X axis. 
This average relationship may be determined mathematically 
by the method of least squares, 


Type i error - rejecting 2 hypothesis wen in fact it ls true. 


Type II error - accenting & hypothesis when in fact ii is 
Talse. 


Variable ~ that which does not maintain a constant value. 
The existence of variables brings forth the need for 
statistics. 





